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Preface 


This book contains an introduction to mathematical proofs, including fundamental material 
on logic, proof methods, set theory, number theory, relations, functions, cardinality, and the 
real number system. The book can serve as the main text for a proofs course taken by 
undergraduate mathematics majors. No specific prerequisites are needed beyond familiarity 
with high school algebra. Most readers are likely to be college sophomores or juniors who 
have taken calculus and perhaps some linear algebra, but we do not assume any knowledge 
of these subjects. Anyone interested in learning advanced mathematics could use this text 
for self-study. 


Structure of the Book 


This book evolved from classes given by the author over many years to students at the 
College of William & Mary, Virginia Tech, and the United States Naval Academy. I have 
divided the book into 8 chapters and 54 sections, including three review sections. Each 
section corresponds very closely to the material I cover in a single 50-minute lecture. Sections 
are further divided into many short subsections, so that my suggested pacing can readily 
be adapted for classes that meet for 75 minutes, 80 minutes, or other time intervals. If the 
instructor omits all sections and topics designated as “optional,” it should be just possible 
to finish all of the core material in a semester class that meets for 2250 minutes (typically 
forty-five 50-minute meetings or thirty 75-minute meetings). More suggestions for possible 
course designs appear below. 

I have tried to capture the best features of live mathematics lectures in the pages of this 
book. New material is presented to beginning students in small chunks that are easier to 
digest in a single reading or class meeting. The book maintains the friendly conversational 
style of a classroom presentation, without relinquishing the necessary level of precision and 
rigor. Throughout this text, you will find the personal pronouns “I” (the author), “you” 
(the reader), and “we” (the author and the reader, working together), reminding us that 
teaching and learning are fundamentally human activities. Teaching this material effectively 
can be as difficult as learning it, and new instructors are often unsure how much time to 
spend on the fundamentals of logic and proof techniques. The organization of this book 
shows at a glance how one experienced teacher of proofs allocates time among the various 
core topics. The text develops mathematical ideas through a continual cycle of examples, 
theorems, proofs, summaries, and reviews. A new concept may be introduced briefly via 
an example near the end of one section, then examined in detail in the next section, then 
recalled as needed in later sections. Every section ends with an immediate review of the 
key points just covered, and three review sections give detailed summaries of each major 
section of the book. The essential core material is supplemented by more advanced topics 
that appear in clearly labeled optional sections. 
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Contents of the Book 


Here is a detailed list of the topics covered in each chapter of the book: 


L. 


Logic: propositions, logical connectives (NOT, AND, OR, XOR, IF, IFF), truth ta- 
bles, logical equivalence, tautologies, contradictions, universal and existential quanti- 
fiers, translating and denying complex logical statements, uniqueness. 


. Proofs: ingredients in mathematical theories (definitions, axioms, inference rules, theo- 


rems, proofs), proof by example, direct proof, contrapositive proof, contradiction proof, 
proof by cases, generic-element proofs, proofs involving multiple quantifiers. 


. Set Theory: set operations (union, intersection, set difference), subset proofs, set equal- 


ity proofs, circle proofs, chain proofs, power sets, ordered pairs, product sets, unions and 
intersections of indexed collections. 


. Integers: recursive definitions, ordinary induction proofs, induction starting anywhere, 


backwards induction, strong induction, integer division with remainder, greatest com- 
mon divisors, Euclid’s GCD algorithm, primes, existence and uniqueness of prime fac- 
torizations. 


. Relations and Functions: relations, images, inverse of a relation, identity relation, 


composition of relations, formal definition of a function, function equality, operations 
on functions (pointwise operations, composition, restriction), direct images, preimages, 
injections, surjections, bijections, inverse functions. 


. Equivalence Relations and Partial Orders: reflexivity, symmetry, transitivity, 


equivalence relations, congruence modulo n, equivalence classes, set partitions, anti- 
symmetry, partial orders, well-ordered sets. 


Cardinality: finite sets, basic counting rules, countably infinite sets, countable sets, 
theorems on countability, uncountable sets, Cantor’s Theorem. 


. Real Numbers (Optional): ordered field axioms for R, algebraic properties, formal 


definition of N and Z and Q, ordering properties, absolute value, distance, Least Upper 
Bound Axiom and its consequences (Archimedean ordering of R, density of Q in R, 
existence of real square roots, Nested Interval Theorem). 


Possible Course Designs 


A standard three-credit (2250 minute) proofs class could cover most of the topics in Chapters 
1 through 7, which are essential for further study of advanced mathematics. When pressed 
for time, I have sometimes omitted or condensed the material on cardinality (Chapter 7) 
or prime factorizations (last half of Chapter 4). Many variations of the standard course are 
also feasible. Instructors wishing to preview ideas from abstract algebra could supplement 
the standard core with the following optional topics: 


the group axioms (end of Section 2.1); 
unique factorization properties for Z and Q (last four sections of Chapter 4); 


formal construction of the integers mod n and the rational numbers using equivalence 
relations (Section 6.6); 


algebraic properties of R developed from the ordered field axioms (Sections 8.1, 8.2, 8.3, 
and possibly 8.4). 
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A course introducing ideas from advanced calculus could include these topics: 
e how to prove statements containing multiple quantifiers (Sections 2.6 and 2.7); 
e general unions and intersections (Section 3.6); 
e properties of preimages of sets under functions (Section 5.7); 
e countable and uncountable sets (Sections 7.2, 7.3, and 7.4); 


e rigorous development of the real numbers (and related number systems) from the ordered 
field axioms (Chapter 8). 


A quarter-long (1500 minute) course focusing on basic proof methods might only cover 
Chapter 1, Chapter 2, and the early sections in Chapters 3 through 6. A quarter course on 
set theory, aimed at students with some prior familiarity with logic and proofs, might cover 
all of Chapters 3, 5, 6, and 7. 

Topics can also be studied in several different orders. Chapter 1 on logic must come 
first, and Chapter 2 on proof methods must come second. Thereafter, some flexibility is 
possible. Chapter 4 (on induction and basic number theory) can be covered before Chapter 
3 (on sets) or omitted entirely. Chapter 6 (on equivalence relations) can be covered before 
the last five sections of Chapter 5 (on functions). Chapter 7 (on cardinality) requires ma- 
terial from Chapter 5 on bijections, but it does not rely heavily on Chapter 6. Finally, the 
optional Chapter 8 (axiomatic development of the real numbers) could be covered anytime 
after Chapter 2, with minor adjustments to avoid explicit mention of functions and rela- 
tions. However, Chapter 8 is more challenging than it may appear at first glance. We are 
all so familiar with basic arithmetic and algebraic facts about real numbers that it requires 
considerable intellectual discipline to deduce these facts from the axioms without acciden- 
tally using a property not yet proved. Nevertheless, it is rewarding and instructive (albeit 
somewhat tedious) to work through this logical development of R if time permits. 


Book’s Approach to Key Topics 


This book adopts a methodical, detailed, and highly structured approach to teaching proof 
techniques and related mathematical topics. We start with basic logical building blocks and 
gradually assemble these ingredients to build more complex concepts. To give you a flavor 
of the teaching philosophy used here, the next few paragraphs describe my approach to 
explaining four key topics: proof-writing, functions, multiple quantifiers, and induction. 


Skills for Writing Proofs 


Like any other complex task, the process of writing a proof requires the synthesis of many 
small atomic skills. Every good proofs textbook develops the fundamental skill of breaking 
down a statement to be proved into its individual logical constituents, each of which con- 
tributes certain structure to the proof. For example, to begin a direct proof of a conditional 
statement “If P, then Q,” we write: “Assume P is true; we must prove Q is true.” I ex- 
plain this particular skill in great detail in this text, introducing explicit proof templates for 
dealing with each of the logical operators. 

But there are other equally crucial skills in proof-writing: memorizing and expanding 
definitions; forming useful denials of complex statements; identifying the logical status of 
each statement and variable in a proof via appropriate status words; using known universal 
and existential statements in the correct way; memorizing and using previously proved 
theorems; and so on. I cover each of these skills on its own, in meticulous detail, before 
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assembling the skills to build increasingly complex proofs. Remarkably, this reduces the 
task of writing many basic proofs into an almost completely automatic process. It is very 
rewarding to see students gain confidence and ability as they master the basic skills one at 
a time and thereby develop proficiency in proof-writing. 

Here is an example to make the preceding ideas concrete. Consider a typical practice 
problem for beginning proof-writers: prove that for all integers x, if x is odd, then x +5 is 
even. In the proof below, I have annotated each line with the basic skill needed to produce 
that line. 


Line in Proof Skill Needed 

1. Let xo be a fixed, arbitrary integer. Prove V statement using generic element. 
2. Assume zo is odd; prove zo + 5 is even. Prove an IF statement by direct proof. 
3. We assumed there is k € Z with rp = 2k + 1. Expand a memorized definition. 

4. We will prove there is m € Z with zo +5 = 2m. Expand a memorized definition. 

5. Doing algebra on the assumption gives: Use logical status words. 

6. ao +5=(2k4+1)4+5 = 2k +6 = 2(k +3). Do basic algebraic manipulations. 

7. Choose m = k + 3, so £o + 5 = 2m holds. Prove J statement by giving an example. 
8. Note m is in Z, being the sum of two integers. Verify a variable is in the required set. 


Virtually every line in this proof is generated automatically using memorized skills; only 
the manipulation in line 6 requires a bit of creativity to produce the multiple of 2. Now, 
while many texts present a proof like this one, we seldom see a careful explanation of how the 
proof uses an assumed existential statement (line 3) to prove another existential statement 
(line 4) by constructing an example (lines 6 and 7) depending on the variable k in the 
assumption. This explanation may seem unnecessary in such a simple setting. But it is a 
crucial ingredient in understanding harder proofs in advanced calculus involving limits and 
continuity. There we frequently need to use an assumed multiply-quantified IF-statement 
to prove another multiply-quantified IF-statement. These proofs become much easier for 
students if they have already practiced the skill of using one quantified statement to prove 
another quantified statement in more elementary cases. 

Similarly, there is not always enough prior coverage of the skill of memorizing and 
expanding definitions (needed to generate lines 3 and 4). This may seem to be a minor 
point, but it is in fact essential. Before writing this proof, students must have memorized 
the definition stating that “x is even” means “there exists k € Z with x = 2k.” But to 
generate line 4 from this definition, k must be replaced by a new variable m (since k was 
already given a different meaning in line 3), and x must be replaced by the expression xo +5. 
I devote many pages to in-depth coverage of these separate issues, before integrating these 
skills into full proofs starting in Section 2.2. 


Functions 


A key topic in a proofs course is the rigorous definition of a function. A function is often 
defined to be a set of ordered pairs no two of which have the same first component. This 
definition is logically acceptable, but it causes difficulties later when studying concepts 
involving the codomain (set of possible outputs) for a function. Since the codomain cannot 
be deduced from the set of ordered pairs, great care is needed when talking about concepts 
that depend on the codomain (like surjectivity or the existence of a two-sided inverse). 
Furthermore, students accustomed to using the function notation y = f(a) find the ordered 
pair notation (x,y) € f jarring and unpalatable. My approach includes the domain and 
codomain as part of the technical definition of a function; the set of ordered pairs by itself 
is called the graph of the function. This terminology better reflects the way most of us 
conceptualize functions and their graphs. The formal definition in Section 5.4 is preceded 
by carefully chosen examples (involving arrow diagrams, graphs in the Cartesian plane, and 
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formulas) to motivate and explain the key elements of the technical definition. We introduce 
the standard function notations y = f(x) and f : X — Y without delay, so students do not 
get bogged down with ordered triples and ordered pairs. Then we describe exactly what 
must be checked when a new function is introduced: single-valuedness and the fact that 
every x in the domain X has an associated output in the claimed codomain Y. We conclude 
with examples of formulas that do or do not give well-defined functions. 


Multiple Quantifiers 


A hallmark of this book is its extremely careful and explicit treatment of logical quantifiers: 
V (“for all”) and 4 (“there exists”). The placement and relative ordering of these quantifiers 
has a big impact on the meaning of a logical statement. For example, the true statement 
Vr € Z, Jy € Z,y > x (“for every integer x there is a larger integer y”) asserts something 
very different from the false statement Jy € Z,Vx € Z,y > x (“there exists an integer y 
larger than every integer x”). However, these doubly quantified examples do not reveal the 
full complexity of statements with three or more nested quantifiers. Such statements are 
quite common in advanced calculus, as mentioned earlier. 

I give a very detailed explanation of multiple quantifiers in Sections 2.6 and 2.7. After 
examining many statements containing two quantifiers, I introduce more complicated state- 
ments with as many as six quantifiers, focusing on the structural outline of proofs of such 
statements. Using these big examples is the best way to explain the main point: an existen- 
tially quantified variable may only depend on quantified variables preceding it in the given 
statement. Other examples examine disproofs of multiply quantified statements, where the 
proof-writer must first form a useful denial of the given statement (which interchanges ex- 
istential and universal quantifiers). Many exercises develop these themes using important 
definitions from advanced calculus such as continuity, uniform continuity, convergence of 
sequences, and least upper bounds. 


Induction 


Another vital topic in a proofs course is mathematical induction. Induction proofs are needed 
when working with recursively defined entities such as summations, factorials, powers, and 
sequences specified by a recursive formula. I discuss recursive definitions immediately before 
induction, and I carefully draw attention to the steps in an induction proof that rely on these 
definitions. Many expositions of induction do not make this connection explicit, causing 
some students to stumble at the point in the proof requiring the expansion of a recursive 
definition (for example, replacing a sum Pyar tp by [Xp] Le] + n41). 

Induction proofs are often formulated in terms of inductive sets: sets containing 1 that 
are closed under adding 1. Students are told to prove a statement Vn € Zso,P(n) by 
forming the set S = {n € Zso : P(n) is true} and checking that S is inductive. This 
extra layer of translation confuses many students and is not necessary. Inductive sets do 
serve an important technical purpose: they provide a rigorous construction of the set of 
natural numbers as the intersection of all inductive subsets of R. I discuss this advanced 
topic in the optional final chapter on real numbers (see Section 8.3), but I avoid mentioning 
inductive sets in the initial treatment of induction. Instead, induction proofs are based on 
the Induction Axiom, which says that the statements P(1) and Vn € Zso, P(n) > P(n+1) 
suffice to prove Yn € Z>0o, P(n). This axiom is carefully motivated both with the visual 
metaphor of a chain of falling dominos and a more formal comparison to previously discussed 
logical inference rules. 
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Additional Pedagogical Features 


(a) Section Summaries and Global Reviews. Every section ends with a concise recap of the 
key points just covered. Each major part of the text (logic and proofs; sets and integers; 
relations, functions, and cardinality) ends with a global review summarizing the material 
covered in that part. These reviews assemble many definitions, theorem statements, and 
proof techniques in one place, facilitating memorization and mastery of this vast amount of 
information. 


(b) Avoiding Logical Jargon. This text avoids ponderous terminology from classical logic 
(such as conjunction, disjunction, modus ponens, modus tollens, modus tollendo ponens, 
hypothetical syllogism, constructive dilemma, universal instantiation, and existential in- 
stantiation). I use only those terms from logic that are essential for mathematical work 
(such as tautology, converse, contrapositive, and quantifier). My exposition replaces anti- 
quated Latin phrases like “modus ponens” by more memorable English names such as “the 
Inference Rule for IF.” Similarly, I refer to the hypothesis P and the conclusion Q of the 
IF-statement P = Q, rather than calling P the antecedent and Q the consequent of this 
statement. 


(c) Finding Useful Denials. This is one of the most crucial skills students learn in a proofs 
course. Every good textbook states the basic denial rules, but students do not always realize 
(and texts do not always emphasize) that the rules must be applied recursively to find a 
denial of a complex statement. I describe this recursive process explicitly in Section 1.5 
(see especially the table on page 35). Section 1.6 reinforces this key skill with many solved 
sample problems and exercises. 


(d) Annotated Proofs. Advanced mathematics texts often consist of a series of definitions, 
theorems, and proofs with little explanation given for how the author found the proofs. This 
text is filled with explicit annotations showing the reader how we are generating the lines of 
a proof, why we are proceeding in a certain way, and what the common pitfalls are. These 
annotations are clearly delineated from the official proof by enclosing them in square brack- 
ets. Many sample proofs are followed by commentary discussing important logical points 
revealed by the proof. 


(e) Disproofs Contrasted with Proofs by Contradiction. A very common student mistake is 
to confuse the disproof of a false statement P with a proof by contradiction of a true state- 
ment Q. This mistake occurs because of inattention to logical status words: the disproof of 
P begins with the goal of proving a denial of P, whereas a proof of Q by contradiction begins 
by assuming the denial of Q. We explicitly warn readers about this issue in Remark 2.60. 


(£) Set Definitions. New sets and set operations are often defined using set-builder notation. 
For example, the union of sets S and T is defined by writing SUT = {@:2€ S or x€ TH. 
This book presents these definitions in a format more closely matching how they arise 
in proofs, by explicitly stating what membership in the new set means. For instance, my 
definition of set union says that for all sets S and T and all objects x, the defined term 
x E€ SUT |can be replaced by the definition text |x € S or x € T | at any point in a proof. 
This is exactly what the previous definition means, of course, but the extra layer of trans- 
lation inherent in the set-builder notation causes trouble for many beginning students. 


(g) Careful Organization of Optional Material. Advanced material and additional topics 
appear in clearly labeled optional sections. This organization provides maximum flexibility 
to instructors who want to supplement the material in the standard core, while signaling 
to readers what material may be safely skipped. 


Preface Xv 


Exercises, Errata, and Feedback 


The book contains more than 1000 exercises of varying scope and difficulty, which may be 
assigned as graded homework or used for self-study or review. I welcome your feedback about 
any aspect of this book, most particularly corrections of any errors that may be lurking in 
the following pages. Please send such communications to me by email at nloehr@vt.edu. I 
will post errata and other pertinent information on the book’s website. 
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to Advanced Mathematics by Smith, Eggen, and St. Andre. My debt to this excellent text 
will be evident to anyone familiar with it. My development as a mathematician and a writer 
has also been deeply influenced by the superb works of James Munkres, Joseph Rotman, 
J. Donald Monk, and the other authors listed in the Suggestions for Further Reading (see 
page 383). I thank all the editorial staff at CRC Press, especially Bob Ross and Jose Soto, 
and the anonymous reviewers whose comments greatly improved the quality of my original 
manuscript. 

I am grateful to many students, colleagues, friends, and family members who supported 
me during the preparation of this book. I especially thank my father Frank Loehr, my 
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express how much I learned about teaching proofs from my students. Thank you all! 
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Nicholas A. Loehr 
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Logic 


1.1 Propositions, Logical Connectives, and Truth Tables 


Many people despise mathematics, believing it to be nothing more than a confusing jumble 
of arcane formulas and mind-numbing computations. This depressing view of the subject is 
understandable, when we consider how math is presented in grade school and many calculus 
classes. But in truth, mathematics is a beautiful, intricately structured tower of knowledge 
built up from a small collection of basic statements (called azioms) using the laws of logic. 
In this book, we shall study the foundation of this tower, as shown here: 


cardinality 
functions 
relations 
integers 
sets 
proofs 
logic 


Propositions 


We begin with propositional logic, which studies how the truth of a complex statement is 
determined by the truth or falsehood of its parts. 


1.1. Definition: Propositions. A proposition is a statement that is either true or false, 
but not both. 


Many things we say are not propositions, as seen in the next example. 


1.2. Example. Which of these statements are propositions? 

a) 7 is positive. 

b) 1+1=7. 

c) Memorize all definitions. 

d) Okra tastes great. 

e) Is it raining? 

f) This sentence is false. 

(g) Paris is a city and 2 + 2 is not 4, or Paris is not a city and 2 + 2 is 4. 

Solution. Statement (a) is a true proposition. Statement (b) is a false proposition. Com- 
mands, opinions, and questions do not have a truth value, so statements (c) through (e) are 
not propositions. Statement (f) is an example of a paradoz: if you assume this statement is 
true, then the statement itself asserts that it is false. If you instead assume the statement 
is false, then the statement is also true. Since propositions are not allowed to be both true 


( 
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and false, statement (f) is not a proposition. Finally, statement (g) is a false proposition, 
for reasons described below. 


Propositional Forms 


Statement (g) in the last example is a complex proposition built up from two shorter propo- 
sitions using logical connective words such as AND, NOT, OR. The two shorter propositions 
are “Paris is a city” and “2 + 2 is 4.” Let us abbreviate the proposition “Paris is a city” 
by the letter P, and let us abbreviate the proposition “2 + 2 is 4” by the letter Q. Then 
statement (g) has the form 


(P AND (NOT Q)) OR ((NOT P) AND Q). 


We can create an even shorter expression to represent the logical form of statement (g) 
by introducing special symbols for the logical connectives. Propositional logic uses the six 
symbols shown in the following table, whose meaning is discussed in detail below. 


Logical Symbol | English Translation 
~P P is not true. 

PAQ P and Q. 

PVQ P or Q (or both). 
PEQ P or Q, but not both. 
P=Q if P, then Q. 

PQ P if and only if Q. 


Using these symbols, we can describe the logical form of statement (g) by the expression 


(P A (~Q)) V (~P) AQ). (1.1) 


Any expression like this, which is built up by combining propositional variables (capital 
letters) using logical symbols, is called a propositional form. 


Definitions of NOT and AND 


In logic and mathematics, language is used in a very precise way that does not always 
coincide with how words are used in everyday conversation. In particular, before going 
further, we need to define the exact meaning of the logical connective words NOT, AND, 
OR, IF, etc. To do this, we give truth tables that show how to combine the truth values of 
propositions to obtain the truth value of a new proposition built from these using a logical 
connective. Here and below, italic capital letters such as P, Q, R are variables that represent 
propositions. The letters T and F stand for true and false, respectively. 


1.3. Definition of NOT. For any proposition P, the truth value of ~P (“not P”) is 
determined by the following table. 


P | ~P 
T| F 
F| T 


Remember this table by noting that ~P always has the opposite truth value as P. 


In our example above, where Q is the true proposition “2 + 2 is 4,” ~Q is the false 
proposition “2 + 2 is not 4.” If R is the false proposition “9 is negative,” then ~R is the 
true proposition “9 is not negative.” 


Logic 3 


The remaining logical connectives combine two propositions to produce a new proposi- 
tion. As seen in the next definition, we need a four-row truth table to list all possible truth 
values of the two propositions that we start with. 


1.4. Definition of AND. For any propositions P, Q, the truth value of P A Q 
(“P and Q”) is determined by the following table. 


P Q|PAQ 
i; T 
T F| F 
F T| F 
F F| F 


Remember this table by noting that PA Q is true only when both inputs P, Q are true; 
in all other cases, P A Q is false. 


In our running example, where P is the true proposition “Paris is a city” and Q is the 
true proposition “2+ 2 is 4,” note that P A (~Q) is false, because P is true and ~Q is false. 
Similarly, (~P) A Q is false, because ~P is false and Q is true. In this example, for any 
proposition R, we can deduce that (~P) A R must be false no matter what the truth value 
of R is, since ~P is already false. On the other hand, P A R is true for true propositions R, 
and false for false propositions R. 


Definitions of OR and XOR 


We now come to the mathematical definition of the word OR. In everyday English, the word 
OR can be used in two different ways. In the inclusive usage of this word, “P or Q” means 
that at least one of the propositions P, Q is true (possibly both). In the exclusive usage, “P 
or Q” means that exactly one of the propositions P, Q is true and the other one is false. Logic 
uses two different symbols for these two usages of the word OR: V stands for inclusive-OR, 
and © stands for exclusive-OR (which is also abbreviated XOR). Remember this convention: 
in mathematics, the English word OR always means inclusive-OR, as defined by the truth 
table below. If you want to use XOR in mathematical English, you must use a longer phrase 
such as “P or Q, but not both.” 


1.5. Definition of OR. For any propositions P, Q, the truth value of P V Q (“P or Q”) 
is determined by the following table. 


P Q|PVQ 
T T-T 
T F| T 
FTIT 
F F| F 


Remember this table by noting that P V Q is false only when both inputs P, Q are false; 
in all other cases, P V Q is true. To avoid confusing the symbols ^ and V, it may help to 
notice that ^ (the symbol for AND) resembles the diagonal strokes in the capital letter A. 


To complete our analysis of statement (g) above, recall in this example that (PA(~Q)) is 
false and ((~P) AQ) is false. So the overall statement is false. On the other hand, PV (~Q), 
(~P) V Q, and P V Q all stand for true propositions in this example. 


1.6. Definition of XOR. For any propositions P, Q, the truth value of PQ 
(“P XOR Q”) is determined by the following table. 
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P_Q|POQ 
T T F 
T F T 
F T T 
F F F 


Remember this table by noting that P 6 Q is true when the input propositions P, Q 
have different truth values; P 6 Q is false when the input propositions P, Q have the same 
truth values. 


1.7. Example. Let P be the false proposition “1+1 = 3,” and let Q be the true proposition 

“2 is positive.” Which propositions below are true? 

(a) POQ; (b) P(Q); (c) (~P)EQ; (d) PVR; (0) PV(~Q); @) (~P)VQ. 
Solution. Using the truth tables for 6, V, and ~, we find that (a) is true, (b) is false, 

(c) is false, (d) is true, (e) is false, and (f) is true. The different answers for (c) and (f) 

illustrate the distinction between exclusive-OR and inclusive-OR. 


Truth Tables for Propositional Forms 


Given a complex propositional form built up from some variables using one or more logical 
connectives, we can determine the meaning of this form by making a truth table showing 
the truth value of the form for all possible combinations of truth values of the variables 
appearing in the form. We construct such a truth table step-by-step, making a column for 
each input variable and each smaller propositional form contained within the given form. 
We complete each column by using the defining truth tables (given above for ~, A, V, ®, and 
given in later sections for = and +) to combine the truth values in one or two previously 
completed columns. We illustrate this process in the following examples. 


1.8. Example: Truth Table for (P AQ) V (~P). This form has two input variables, P 
and Q, so we need a four-row truth table. We make columns for P, Q, ~P, P A Q, and 
(PA Q)V (~P), and fill them in as shown here: 


Note, in particular, that the final column is completed by comparing the truth values 
in each row of columns 3 and 4. In row 2 (and only row 2), the two input truth values are 
both false, so the output in this row is false. All other rows must have output that is true, 
by definition of OR. 


1.9. Example: Truth Table for PA (QV (~P)). This form looks almost identical to the 
one in the last example, but the parentheses are in a different position. This changes the 
structure of the truth table, as shown here: 


P Q|6P QV(~P) PA(QV(~P)) 


T T| EF T T 
T F| F F F 
F TIT T F 
F FIT T F 


We fill in column 4 by looking for two Fs in columns 2 and 3 (which happens only in 
row 2), placing an F in that row, and filling the rest of column 4 with Ts. Then we fill in 
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column 5 by looking for two Ts in columns 1 and 4 (which happens in row 1 only), placing 
a T in that row, and filling the rest of column 5 with Fs. 


In these examples, observe that the truth tables for (PA Q)V (~P) and PA (QV (~P)) 
are not identical, since the outputs in rows 3 and 4 differ in the two tables. This shows 
that these two propositional forms are not logically interchangeable with one another in 
all situations. More briefly, we say that the two forms are not logically equivalent. On the 
other hand, if two propositional forms A and B have truth tables whose outputs agree in 
every row, we say that A and B are logically equivalent and write A = B. (We study logical 
equivalence in detail in the next few sections.) Notice, for instance, that P A (Q V (~P)) is 
logically equivalent to P ^ Q, since column 5 in the previous example agrees with the truth 
table for P A Q in all four rows. So we could write P A (Q V (~P)) = P AQ, enabling us 
to replace the complicated propositional form on the left by the shorter form P A Q on the 
right. 


1.10. Example: Translation of XOR. We have noted that P 6 Q can be translated 
as “P or Q, but not both.” This English phrase is built up using the logical connectives 
OR, NOT, BUT, and BOTH (the last two words have the same meaning as AND). So 
a more literal encoding of the phrase “P or Q, but not both” as a propositional form is 
(PVQ)A(~(PAQ)). The following truth table verifies that (PVQ)A(~(PAQ)) = PEQ. 
This logical equivalence justifies the use of this phrase as a translation of XOR. 


P Q|P8eQ PVQ PAQ APAQ) (PVQ)A((PAQ)) 
F F 


T T F T T 

T F T T F T T 
F T T T F T T 
F F F F F T F 


Note that we combine columns 4 and 6 to complete column 7 using the definition of AND. 
The claimed logical equivalence follows from the complete agreement of columns 3 and 7 in 
every row. In contrast, since columns 3 and 4 disagree in row 1, we see that (PEQ) 4 (PVQ). 
In other words, inclusive-OR and exclusive-OR are not logically equivalent. 


Formal Definition of Propositional Forms (Optional) 


In the main text, we informally defined a propositional form to be any expression built 
up by combining propositional variables using logical symbols. To give a precise, rigorous 
definition of propositional forms, we need a recursive definition (definitions of this type 
are studied in detail later). Specifically, propositional forms are defined recursively via the 
following rules: 


b) If A is any propositional form, then (~A) is a propositional form. 

c) If A and B are any propositional forms, then (A A B) is a propositional form. 
d) If A and B are any propositional forms, then (A V B) is a propositional form. 
e) 
) 
8 


For example, rule (a) shows that P is a propositional form, as is Q. Then rule (b) shows 
that (~P) and (~Q) are propositional forms. By rule (c), (P A (~Q)) and ((~P) A Q) are 
propositional forms. By rule (d), 


((P A (~Q)) v (~P) ^A Q)) (1.2) 
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is a propositional form, which is essentially the form studied in (1.1). 

However, to be absolutely precise, the expression in (1.1) is not a propositional form. 
This is because (1.1) is missing the outermost pair of parentheses, which are required to be 
present when using rules (b) through (g). In practice, we often drop required parentheses 
to create abbreviated versions of propositional forms that are easier to read. For instance, 
we seldom write the outermost pair of parentheses. Logical equivalences called associativity 
rules (discussed in the next section) allow us to drop internal parentheses in an expression 
like ((PVQ)V R) or (PA(QA R)). We can erase even more parentheses using the precedence 
conventions that ~ has highest precedence, followed by A, followed by V. For example, with 
these conventions, we could drop all parentheses from (1.2) and abbreviate this form as 
PA~QV~PAQ. However, as shown in Examples 1.8 and 1.9, the placement of parentheses 
often affects the logical meaning of a propositional form (which is one reason why the official 
definition requires them). In this text, we often write parentheses that could be deleted by 
the precedence conventions, to reduce the chance of confusion. 


1.11. Remark: Terminology for Propositional Forms. Let A and B be any propo- 
sitional forms. In some logic texts, (~A) is called the negation of A; (A A B) is called 
the conjunction of A and B; (AV B) is called the disjunction of A and B; (A => B) is 
called an implication or conditional with hypothesis (or antecedent) A and conclusion (or 
consequent) B; (A & B) is called the biconditional of A and B. We mostly avoid using 
these technical terms in this text, except for the term negation. Later, we study methods 
of finding propositional forms logically equivalent to (~A), which are called denials of A. 


Section Summary 


1. Memorize the definitions of the logical connectives, as summarized in the following 
truth table. 


NOT P PANDQ PORQ PxXORQ 


P Q| ~P PAQ PVQ PQ 
T T| F T T F 
T F| F F T T 
FT| T F T T 
FF| T F F F 


Note that NOT flips the truth value of the input; AND outputs true exactly when 
both inputs are true; OR outputs false exactly when both inputs are false; and 
XOR outputs true exactly when the two inputs differ. 


2. Remember that in mathematics, “P or Q” always means P V Q (inclusive-OR). 
To translate P @ Q (exclusive-OR) into English, you must use an explicit phrase 
such as “P or Q, but not both.” 


3. Two propositional forms A and B are logically equivalent, denoted A = B, when 
these forms have truth tables whose outputs agree in all rows of the truth table. If 
there is even a single row of disagreement, the forms are not logically equivalent. 
For instance, PVQ4#P@Q. 


Exercises 


1. Is each statement true, false, or not a proposition? Explain. 
(a) 1+ 1 = 2 or London is a city. (b) Math is fun. (c) France is a country, and 
Berlin is an ocean. (d) 0 < lor 1 < 0. (e) 0 < lor1 <0, but not both. (f) Chopin 
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was the greatest classical composer. (g) 2 is not even’, or 2 is positive. (h) 15 is 
odd or positive, but not both. 


2. Is each statement true, false, or not a proposition? Explain. 

(a) It is not the case that 2 is even or 2 is positive. (b) The following statement 
is a true proposition. (c) i x? dx = 1/3 and 2° = 32. (d) When do fish sleep? 
(e) The following statement is a true proposition. (f) The previous statement is 
a false proposition. (g) I am tired. 

3. Construct truth tables for the following propositional forms. 

(a) (~P)AQ (b) (PAQ)V (HP) A(~Q)) (c) (PV (~Q)) © (~Q) V P) 
(dq) (PAQ)VR (e)PA(QVR) (f) PV(QER) 

4. (a) Is ~(P V Q) logically equivalent to (~P) V (~Q)? Explain with a truth table. 
(b) Give English translations of the two propositional forms in (a). 

5. Assume P is a true proposition, Q is a false proposition, and R is an arbitrary 
proposition. Say as much as you can about the truth or falsehood of each propo- 
sition below. 

(a) PA(~Q) (b) (~P)VQ (ce) PEQ (d) PS(~Q) (e) RVP (QVR 
(8e) QPR (h) RA(~R) (i) RV(~R) G) ROR (k) R@(~R). 

6. (a) Make a truth table for the propositional form (PA(~Q)) V((~P) AQ). Be sure 
to show the columns for all intermediate forms. (b) Find a short propositional 
form that is logically equivalent to the form in (a). (c) Use (b) to write an English 
sentence with the same logical meaning as statement (g) in Example 1.2. 

7. (a) Make a truth table for PA((~Q)VP). (b) Make a truth table for (PA(~Q))VP. 
(c) Are the propositional forms in (a) and (b) logically equivalent? Why? 

(d) Find a short propositional form that is logically equivalent to the form in (a). 
8. Which propositional forms (if any) are logically equivalent? Justify your answer 
with truth tables. (a) P (b) ~P (c) PVP (d) PAP (e) PẸP (£f) (PAQ)VP. 
9. Which propositional forms (if any) are logically equivalent? Explain. 
a) PAQ (b) QAP (c) (PAQ)AP (d) (PQ). 

10. Which propositional forms (if any) are logically equivalent? Explain. 
a) (PAQ) (b) (~P) A^ (~Q) (c) (HP) AQ) V (PA (~Q)) 
d) PS (~Q) (e) PV (~Q) (£) (PV (~Q)) A (~P) V Q). 

11. Which propositional forms (if any) are logically equivalent? Explain. 
a) PV(QVR) (b) P@(QVR) (c) PV(QSR) (dA) PO(QGR) 
e) (PVQ)VR (f) PeQ)VR (g) (PVQ)OR (h) (PEQ)OR. 

12. Let xo be a fixed integer. Let P be the proposition “zo < 8,” let Q be “0 < xo,” 
and let R be “zo = 0.” Encode each statement as a propositional form involving 
P,Q, and R. 

(a) zo is strictly positive. (b) xo > 8. (c) zo is less than zero. 
(d) zo is nonnegative. (e)0<a <8 (f)0<a <8. (g) |ao—4| > 4. 

13. Let P be the statement “1+1 = 2,” let Q be “0 = 1,” and let R be “71 is prime.” 
Convert each propositional form into an English sentence without parentheses. Be 
sure that the sentence has the precise logical structure encoded by the form. 

(a) (PAQ)VR (b) PA(QVR) (APAQ) (d) (~P)AQ 
(e) (PAQ)AR (f) PA(QQAR) (8) (WP) A(~Q)) 9 (~R) 
1Some exercises and examples in Chapter 1 assume familiarity with even integers ...,—4,—2,0,2,4,... 


and odd integers ...,—5,—3,—1,1,3,5,.... Formal definitions of even and odd appear in §2.1. 


14. 


15. 


16. 


17. 


18. 


19. 
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Explain why each statement is not a proposition. 

(a) The last train leaves in five minutes. (b) We should raise the minimum wage. 
(c) Will you marry me? (d) My father’s name is Frank. (e) She is smarter than 
he is. (f) Do your homework. (g) It is cold here. (h) This statement is true. 


Find propositional forms using only the connectives V, A, and ~ that have the 
truth tables shown below. Try to use as few connectives as possible. 


my ay ey H a H 
2 


HEHHE 


gs izsiz fes i ia Har far BAY 
5 jH Hmm Ho 
MHA aH H 
HAHAHAHA YS 


AAPA yyy 


(a) Find a simple verbal description characterizing when ((A®B)@(C@D)) GE 
is true. (b) Does your answer to (a) change if we rearrange parentheses in the 
expression? What if we reorder A, B, C, D, and E? 

Find a propositional form involving variables P, Q, R that is true precisely when 
exactly one of P, Q, R is true. Try to use as few logical connectives as possible. 


Find a propositional form involving variables P, Q, R that is true when at least 
two of the inputs are true, and false otherwise. Try to use as few logical connectives 
as possible. 


(a) How many truth tables are possible for propositional forms containing two 
variables P and Q? (b) For each possible truth table in (a), find a short proposi- 
tional form having that truth table. 
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1.2 Logical Equivalences and IF-Statements 


You have probably been using the word IF for most of your life, but do you really know 
what this word means? The official answer, given in this section, may surprise you. We also 
look more closely at logical equivalence, developing transformation rules for propositional 
forms analogous to some basic laws of algebra. Everything is proved using our favorite proof 
technique, truth tables. 


Logical Equivalence 


We first recall the definition of logically equivalent propositional forms from the last section. 


1.12. Definition: Logically Equivalent Propositional Forms. Two propositional 
forms A and B are logically equivalent when the truth tables for A and 6 have outputs 


that agree in every row. We write | A = B | when A and B are logically equivalent; we write 


A # B| when A and B are not logically equivalent. 


Our first theorem lists some logical equivalences that can often be used to replace com- 
plicated propositional forms by shorter ones. 


1.13. Theorem on Logical Equivalence. For all propositional forms P, Q, and R, the 

following logical equivalences hold: 

(a) Commutative Laws: PAQ = QAP, PVQ=QVP,andP@Q=QEP. 

(b) Associative Laws: PA (QAR) = (PAQ)AR, PV(QVR) = (PVQ)VR, 
and PE (QSR) = (PE Q)ƏR. 

(c) Distributive Laws: PA (QV R) = (PAQ) V (PAR), PV (QAR) = (PVQ)A(PVR), 
and PA(Q@ R) = (PAQ)G(PAR). 

(d) Idempotent Laws: PA P = P and PVP = P. 

(e) Absorption Laws: PA (PV Q) = P and PV (PAQ) =P. 

(£) Negation Laws: ~(~P) = P, (PAQ) = (~P) vV (~Q), and ~(PVQ) = (~P) A (~Q). 


The names of the first three laws indicate the analogy between these logical equivalences 
and certain algebraic properties of real numbers. For example, the commutative laws for 
propositional forms resemble the commutative laws £ + y = y + x and z- y = y- x for real 
numbers x, y; similarly, observe the resemblance between the distributive laws listed above 
and the algebraic distributive law x- (y + z) = (x - y) + (a: z). 


All parts of this theorem are proved by constructing truth tables for the propositional 
forms on each side of each equivalence, and verifying that the truth tables agree in every 
row. We illustrate the technique for a few parts of the theorem, asking you to prove the 
other parts in the exercises. 


1.14. Proof of the Idempotent Laws. The truth table is shown here: 


P|P PAP PVP 
T/T T T 
F|F F F 


We list the column for P twice to make it easier to compute P A P and P V P. We 
compute the last two columns from the first two columns using the definition of AND and 
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OR, respectively. Since column 3 and column 2 agree in all rows, P A P = P. Since column 
4 and column 2 agree in all rows, PV P= P. 


1.15. Proof of ~(P V Q) = (~P) A (~Q). For this equivalence, we need a four-row truth 
table: 


P_Q|PVQ_~(PVQ|~P_~@ (PAO) 
T T T F F F F 
T F T F F AR F 
F T T F T F F 
F F F T T T T 


Columns 4 and 7 agree in every row, so the logical equivalence is proved. This equivalence 
and its companion ~(P A Q) = (~P) V (~Q) are called de Morgan’s laws. In words, the 
law we just proved says that the negation of an OR-statement “P or Q” is equivalent to 
the statement “(not P) and (not Q),” whereas the dual law tells us that “not (P and Q)” 
is logically equivalent to “(not P) or (not Q).” For a specific example, the statement “it is 
false that 10 is prime or 10 is odd” may be simplified to the statement “10 is not prime and 
10 is not odd” without changing the truth value (both statements are true). Similarly, the 
statement that “John has brown hair AND blue eyes” is false precisely when John’s hair is 
not brown OR John’s eyes are not blue. In 81.5, we will say a lot more about simplifying 
statements that begin with NOT. 


1.16. Proof of Associativity of XOR. This law has three input variables, so we need 
a truth table with eight rows, as shown below. One quick way to create the eight possible 
input combinations is to write 4 Ts followed by 4 Fs in the column for P; then write 2 Ts, 
2 Fs, 2 Ts, 2 Fs in the column for Q; then write alternating Ts and Fs in the column for 
R. This pattern extends to truth tables with even more variables. In general, if there are n 
distinct input variables that stand for propositions, the truth table has 2” rows. 


QSR PSE(QSR)|PSQ (PEQ)ER 


AeA yaya 
z5 a Ha HA Ho 
=A HHH 
MOA aay 
MHAmeagy 
yy 
Maya H 


The quickest way to fill in this truth table is to look for disagreements between the 
inputs of each XOR (being sure to refer to the correct columns), writing Ts in those rows, 
and writing Fs in all other rows. Columns 5 and 7 agree in all eight rows, so the associative 
law for @ is proved. The other associative laws and distributive laws are proved by similar 
eight-row truth tables. 


What IF Means 


We are now ready to define the precise logical meaning of the word IF. Be warned that the 
usage of IF in everyday conversational English often does not match the definition we are 
about to give. Nevertheless, this definition is universally used in logic, mathematics, and all 
technical communication. When discussing the statement P => Q, which is read “if P then 
Q,” we call P the hypothesis and Q the conclusion of the IF-statement. 
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1.17. Definition of IF. For any propositions P and Q, the truth value of P > Q 
(“if P then Q”) is determined by the following table. 


P Q|P3Q 
T T T 
T F F 
F T T 
F F T 


Remember this table by noting that P > Q is false only when the hypothesis P is true 
and the conclusion Q is false; in all other situations, the IF-statement is true. In particular, 
when P is false, the IF-statement P > Q is automatically true, whether Q is true or false; 
this fact often confuses beginners. 


1.18. Example. True or false? 

(a) f1+1=3, then 1+1=5. 

(b) If 0 £0, then 0 = 0. 

(c) If 3 > 2, then London is the capital of England. 

(d) If 9 is odd, then 9 is even. 

Solution. Statement (a) has the form P > Q, where P is the false statement “1+ 1 = 3,” 
and Q is the false statement “1+1 = 5.” By definition of IF, statement (a) is true, no matter 
how unintuitive this statement may sound. Statement (b) has the form (~Q) > Q, where 
Q is the true proposition “0 = 0.” Since the hypothesis ~Q is false, we can see that (b) is 
true without even reading the statement following THEN. Statement (c) is true (according 
to row 1 of the truth table), even though the two parts of the statement have no causal 
relation to each other. Finally, statement (d) is false because “9 is odd” is true, but “9 is 
even” is false. 

These examples show that we must not rely upon an intuitive impression of the content 
of an IF-statement to determine its truth value; we must instead combine the truth values 
of the inputs according to the defining truth table. As shown in (c), there need not be any 
cause-and-effect relationship between the hypothesis and conclusion of an IF-statement. 


It is fair to ask why IF is defined in such an apparently unnatural way. Most people agree 
that “if true then true” should be considered true, whereas “if true then false” should be 
considered false. Why, though, is “if false then anything” defined to be true? One possible 
answer draws an analogy between IF-statements, as they are used in logic, and IF-statements 
appearing in legal contracts or judicial proceedings. Suppose a painting company issues a 
contract to a homeowner containing the statement: “if you pay us $1000 by May 1, then we 
will paint your house by June 1.” Under what conditions would we say that the company 
has violated this contract? For example, suppose the company does not paint the house 
because it was not paid on time. Both clauses of the IF-statement are false, yet no one 
would say that the company violated the contract in this situation. Similarly, supposing 
that the money is paid on May 3 and the house is painted on time, the company has not 
violated the contract even though the first part of the IF-statement is false. 

We can give another, more mathematical motivation for the definition of IF by consider- 
ing the following statement: “for all real numbers z, if x > 2 then x? > 4.” (This statement 
uses variables and quantifiers, which are discussed later.) Most people would judge this 
statement to be true, on intuitive grounds, and this judgment is correct. But once you 
agree this statement should be true, you are forced to agree that it should be true after 
replacing each x in the statement by any specific real number. In particular, we see that 
each of these statements must be true: 


e “if 3 > 2 then 3? > 4” (illustrating row 1 of the truth table); 
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e “if —4 > 2 then (—4)? > 4” (illustrating row 3 of the truth table); 
e “if 0 > 2 then 0? > 4” (illustrating row 4 of the truth table). 


Thus the whole truth table for IF is forced upon us, granting that row 2 of the truth table 
ought to be false. 

In mathematics, a statement “if P then Q” is not making any claims about the truth 
of the conclusion Q in the event that the hypothesis P does not hold; so in this event, we 
should not declare the IF-statement to be false. But since propositions must have some 
truth value, we are forced to call the IF-statement true in such situations. 


Converse and Contrapositive 


Given an IF-statement P > Q, we can form two related IF-statements called the converse 
and contrapositive of the original statement. 


1.19. Definition: Converse and Contrapositive. For any propositional forms P and 
Q: the | converse of P > Q lis | Q = P |; the | contrapositive of P > Q lis | (~Q) > (~P) |. 


1.20. Example. The converse of “if 9 is negative, then 9 is odd” is “if 9 is odd, then 9 
is negative.” Observe that the first statement is true, whereas the converse is false. The 
contrapositive of “if 9 is negative, then 9 is odd” is “if 9 is not odd, then 9 is not negative,” 
which is true. 

The converse of “if 2 = 3, then 4 = 9” is “if 4 = 9, then 2 = 3.” Both the original 
statement and its converse are true. The contrapositive of “if 2 = 3, then 4 = 9” is “if 
4 #9, then 2 43,” which is also true. 


In both examples, the contrapositive had the same truth value as the original statement, 
but this was not always the case for the converse. The next theorem explains what happens 
in general. 


1.21. Theorem on IF. Let P and Q be distinct propositional variables. 
(a) Non-equivalence of Converse: P>Q 4 Q=>P. 


(b) Equivalence of Contrapositive: P= Q = (~Q) => (~P). 
(c) Elimination of IF: P> Q = (~P) VQ. 
(d) Denial of IF: {P> Q) = PA(-Q). 


Part (a) says that a general IF-statement is not logically equivalent to its converse. Part 
(b) says that a general IF-statement is logically equivalent to its contrapositive. Part (c) says 
that IF-statements are essentially special kinds of OR-statements in which the first input 
to the OR has been negated. We can use (c) to convert IF-statements to OR-statements 
and vice versa. Part (d) shows that the negation of an IF-statement is logically equivalent 
to an AND-statement. Parts (b), (c), and (d) hold for any propositional forms P and Q, 
not just individual letters. For instance, (A V B) > (C@D) = (~(AV B) v (CD) 
follows by replacing P by AV B and replacing Q by C @ D in part (c). Memorize all parts 
of this theorem, as they will be used constantly throughout our study of logic and proofs. 

To prove the theorem, consider the following truth table: 


Q|P=Q e. = (~Q) > (~P) | &P)VQ 


=> 
T 
F 
T 
T 


Ay H H Y 
md 
HA 


T 
F 
T 
T 


HHIH 
HH 
HH 
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Column 3 does not match column 4 in rows 2 and 3, proving (a). Column 3 matches 
column 7 in all rows, proving (b). Column 3 also matches column 8 in all rows, proving 
(c). We can prove (d) by adding a few more columns to this truth table. Alternatively, (d) 
follows from (c) and the Negation Laws, since 


MP >=Q) = ~((~P)VQ) = (~P) AQ) = PAQ). (1.3) 


Five Remarks on Logical Equivalence (Optional) 


(1) The Theorem on Logical Equivalence shows that many of the laws of logic have an 
algebraic character analogous to algebraic laws for arithmetical operations on real numbers. 
To make this analogy more explicit, we introduce the convention of representing true (T) 
by 1 and false (F) by 0. The symbols 0 and 1 are called bits. In this notation, the defining 
truth tables for the six logical operators (including IFF, discussed in the next section) look 
like this: 


P Q|~P PAQ PVQ PEQ P=Q PeR 
0 Of 1 0 0 0 1 1 
0 1 1 0 1 1 1 0 
1 0| 0 0 1 1 0 0 
1 1 0 1 1 0 1 1 


If we think of P and Q as bits (the actual integers 0 and 1, rather than propositions), 
we see that ~P = 1—P; PAQ = P-Q = min(P, Q), the product or minimum of P and Q; 
P V Q = max(P, Q), the maximum of P and Q; and P @ Q is the mod-2 sum of P and Q. 
(We discuss modular addition in detail later.) We can also think of 6, >, and & as relations 
comparing their two inputs. In this interpretation, P 6 Q is true precisely when P 4 Q, so 
® models non-equality of bits; P = Q is true precisely when P = Q, so + models equality 
of bits; and P = Q is true precisely when P < Q, so = models ordering of bits. Now the 
various identities in the Theorem on Logical Equivalences translate into identities for these 
algebraic operations on bits. For instance, supposing we already know that «(yz) = (xy)z 
for all integers x,y,z, we can deduce the associativity law P A (QA R) = (PA Q)AR for 
propositional forms. 

(2) Logical equivalence of propositional forms (denoted =) is similar to, but not the 
same as, equality of forms (denoted =). Thinking of propositional forms A and B as strings 
of symbols, A = B means that A has exactly the same symbols as $, in the same order. For 
example, PV Q is not equal to QV P since the first symbols do not match. Yet, as we know, 
PVQ=QVP since the two truth tables agree. For most purposes of logic, two logically 
equivalent forms are interchangeable with each other (since their truth values must agree), 
even though such forms are often not equal as strings. 

(3) Although logical equivalence of propositional forms (denoted =) is not the same 
concept as logical equality (denoted =), these two concepts share many common properties. 
For example, given any objects x, y, z whatsoever, the following facts are true about equality: 


e=«; if*=ythny=a2; ifx= yand y= z then z = z. 


These facts are called reflexivity, symmetry, and transitivity of equality. Three analogous 
facts also hold for logical equivalence: given any propositional forms A, B, C, 


A=A; if A=BthnB=A; if A=B and B =C then A=C. (1.4) 


These facts follow from the corresponding facts for equality, applied to the rows of the truth 
tables. For instance, if the output in every row of A’s truth table equals the corresponding 


14 An Introduction to Mathematical Proofs 


output of 6’s truth table, then the output in every row of 6’s truth table equals the corre- 
sponding output of A’s truth table, which is why the second fact is true. The text implicitly 
uses the reflexivity, symmetry, and transitivity of logical equivalence in many places. For 
instance, the derivation (1.3) has the form A = B = C =D, which is really an abbreviation 
for “A= B and B = C and C =D.” Using transitivity once, we see that A= C. Using tran- 
sitivity again, we see that A = D, which is the required conclusion. Similarly, symmetry of 
= is needed whenever we want to use a known logical equivalence in reverse, e.g., to replace 
(PAQ)V (PAR) by PA (QV R) by invoking the Distributive Law. We will have much 
more to say about the properties of reflexivity, symmetry, and transitivity in Chapter 6, 
when we study equivalence relations. 

(4) The logical equivalences in this section’s theorems apply to all propositional forms 
P, Q, R, which means that we can replace each letter P, Q, R (wherever it appears in an 
identity) by a longer propositional form. For instance, replacing P by A > B and Q by BOC 
in the second negation law, we see that ~((A > B)A(B@C)) = (~(A=> B))V(~“(BO@C)). 
In contrast, consider the non-logical equivalence asserted in part (a) of the Theorem on 
IF. In that theorem, P and Q are distinct propositional variables (as opposed to general 
propositional forms), and we have (P > Q) # (Q = P) as shown in the proof. But for 
some choices of the form P and the form Q, the logical equivalence will hold; for instance, 
suppose P and Q are both R. It can be checked that when P and Q are logically equivalent 
forms, (P > Q) = (Q => P), because in this case we must always be in row 1 or row 4 of the 
truth table shown in the proof. (In fact, using terminology from the next section, P > Q 
and Q = P are both tautologies in this situation.) For example, ((A V B) > (BV A)) is 
logically equivalent to ((B V A) = (AV B)). On the other hand, when P and Q are not 
logically equivalent, there must exist some assignment of truth values to the propositional 
variables appearing in P and Q that cause P and Q themselves to have opposite truth 
values. In this row of the truth table, P > Q and Q => P disagree, so that these forms are 
not logically equivalent. For instance, ((A V B) = (A@ B)) and ((A@ B) = (AV B)) are 
not logically equivalent, as we see by considering the row of the truth table where A and B 
are both true. 

(5) In algebra, if we know two variables represent the same number, we can replace 
one variable by the other in longer expressions. We can express this fact symbolically by 
substitution rules such as the following: for all numbers z, y, z, if y = z then x+y = «+z and 
xy = xz. There are analogous substitution rules for logically equivalent propositional forms. 
For instance, given any propositional forms A, B, and C, if B = C, then (AV B) = (AVC), 
(AA B) = (AAC), (A => B) = (A => C), (B => A) = (C = A), and so on. We freely use 
these substitution rules without justification in the main text, but it is possible to prove 
each rule by comparing truth tables. As an example, assume B = C, and consider a fixed 
row in the truth tables for A > B and A => C. Suppose that, for the values of the input 
propositional variables appearing in this row, A is true and B is false. On one hand, A > B 
must be false in this row. On the other hand, since $ and C were assumed to be logically 
equivalent, C is false in this row, so A => C is also false in this row. Thus, A > B and A > C 
have the same truth value (namely, false) in this row. Considering the other possible cases, 
we see that A > B and A => C have the same truth value in every row of the truth table, so 
they are logically equivalent. We can summarize the cases in a meta-truth table that looks 
like this: 


A B C\|ASB ASC 
TT T T T 
T F F F F 
FTT T T 
F F F T T 


Logic 15 


We call this a meta-truth table because the underlying propositional variables appearing 
in A, B, and C are not explicitly listed. (For example, these forms might involve five propo- 
sitional variables P, Q, R, S, and T, so that the actual truth table would have 32 rows.) 
However, to prove the logical equivalence of A = B and A => C, we only need to know the 
truth values of the propositional forms A, B, and C. Row 2 of the table is an abbreviation 
for the discussion preceding the table, and the other rows cover the remaining cases. The 
key point is that the table has four rows, not eight, because we have assumed that B and C 
are logically equivalent forms, hence always have the same truth value. You can construct 
similar meta-truth tables to prove the other substitution rules. 


Section Summary 


1. The logical operators satisfy the commutative, associative, distributive, idempo- 
tent, and absorption laws listed in the Theorem on Logical Equivalence. They 
also obey these negation rules: 


(PAQ) = (~P)V(~Q), MPV Q) = (~P) A (~Q), 
~(~P)= P, ~(P>Q)=PA(-Q). 
We check all these logical equivalences by verifying that the truth tables for each 
side agree in every row. 
2. P= Q is false when P is true and Q is false; P > Q is true in all other situations. 


3. The converse of P > Q is Q => P, which is not logically equivalent to P > Q. 
The contrapositive of P > Q is (~Q) = (~P), which is logically equivalent to 
P = Q. We have P > Q = (~P) V Q, and hence ~(P > Q) = PA (~Q). 


Exercises 


1. Make a truth table for each propositional form. 
(a) (PV (~Q)) +P (b) P= (Q^ (~P)) (c) (PEQ) => (PVQ). 
2. Make a truth table for each propositional form. 
(a) (P > Q)A(Q => R) (b) (P= Q)=> (QVR) (c) (~(P8Q)) > (~(RAP)). 
3. Prove the negation laws ~(~P) = P and ~(P AQ) = (~P) V (~Q) using truth 
tables. 
4. Use truth tables to prove the commutative laws for A, V, and @ in the Theorem 
on Logical Equivalences. 
Prove the associative laws for A and V. 
Prove the absorption laws in part (e) of the Theorem on Logical Equivalences. 
Prove the distributive laws in part (c) of the Theorem on Logical Equivalences. 
Prove or disprove: (~P) 6 (Q @ P) = (~Q). 
True or false? 
a) If Mars is a planet, then 2 + 2 = 4. 
b) If 2-2 44, then 7 is negative. 
c) If Paris is a city, then Europe is an ocean. 
d) If 0 = 0, then 5 is even. 
e) If some trees have roots, then all crows are white. 
ee 
f) 1f3+3£6, then | ay ot = 2/4. 
g) If 1+1 = 3, then 1+1 = 2, but the converse is false. 
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Assume P is a true proposition, Q is a false proposition, and R is an arbitrary 
proposition. Say as much as you can about the truth or falsehood of each propo- 
sition below. 

(a)P>R (b)QS>R (c) RSP A) R>=Q e) R>=R (ff) RS (WR). 
Let xo be a fixed integer, and let P be the statement: 

“if £o is even and zxọ is prime, then xp = 2.” 

a) Write the converse of P. 

b) Write the contrapositive of P. 

c) Write the converse of the contrapositive of P. 

d) Suppose zo is 8. Is P true or false? What about the converse of P? 


The inverse of P > Q is defined to be (~P) => (~Q). Decide whether the inverse 
of P > Q is logically equivalent to: (a) P = Q; (b) the converse of P > Q; 
c) the contrapositive of P > Q. 


Write the converse and the contrapositive of each statement. 
a) If f is continuous or monotonic, then f is integrable. 

b) If g is not continuous, then g is not differentiable. 

c) If p is prime, then if p is even, then p = 2. 


Use known logical equivalences to show that (P A R) => (~Q) is logically equiva- 
lent to ~(P A (Q A R)) without making a truth table. 


(a) Use known logical equivalences to show that P > (Q = R) is logically 
equivalent to (P A Q) > R without making a truth table. (b) Is (P > Q) > R 
logically equivalent to P > (Q = R)? Explain. 

Write the converse of (P > Q) = R and the contrapositive of (P > Q) > R. 


Use the Theorem on IF to replace each propositional form with an equivalent 
form that does not use =>. (a) (PAQ) = (RVS) (b) P=> ((~Q) > P) 


c) [P > (Q > R)] > [(P > Q) > (P= R)] 

Which proposed distributive laws are correct? Prove your answers. 
a) PV(QSR)=(PVQ)S(PVR). 

b) P> (QAR)=(P>Q)A(P>R). 

c) P>(QVR)=(P=Q)V(P= R). 

d) P>(Q@R)=(P]Q)O(PS= R). 

e) PA(Q=> R)=(PAQ)=> (PAR). 

f) PV(Q=> R)=(PVQ)=> (PVR). 

g) P@(Q> R)=(POQ)> (POR). 


Prove AA(BVCV DV E) = (AA B)V(AAC)V (AAD) V (AA E) without 
using truth tables. 


Use logical equivalences from this section to find four different IF-statements that 
are logically equivalent to (~P) V (~Q) V R. 

The FOIL Rule in algebra says that for all real numbers a, b, c, d, (a+b): (c+d) = 
(ac+ad)+(bc+bd). (a) Prove this rule using laws of algebra. (b) Formulate three 
analogous FOIL rules for logical connectives, and prove these rules without truth 
tables using the Theorem on Logical Equivalences. 


Use known logical equivalences (not truth tables) to prove: 
(A> B)A(C => D) = (Av C) v (4> D) v (C => B)V(BAD). 
True or false? Explain. 


(a) The contrapositive of P > Q always has the same truth value as P > Q. 
(b) The converse of P > Q always has the same truth value as the negation of 
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P=Q. 

(c) If P > Q is false, then the converse must be true. 

(d) If P > Q is true, then the converse must be true. 

(e) If the hypothesis and conclusion of a conditional statement have the same 
truth value, then the conditional statement must be true. 

Give specific examples of propositional forms A and 6 with each property below. 
Explain why your examples work. (a) A V B is logically equivalent to A © B. 
(b) A= B = B => A, but A> BF ANB. (c) AVB is logically equivalent to 
~(A^ B). 

Prove that logical equivalence of propositional forms is reflexive and transitive, 
as stated in (1.4). 


Prove the substitution rules for propositional forms given in Remark 5 on page 14. 
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1.3 IF, IFF, Tautologies, and Contradictions 


Now that we know what IF means, it is time to talk about other English words and phrases 
that have the same logical meaning as an IF-statement. We also introduce the new logical 
connective IFF (pronounced “if and only if”). In everyday life, people often say IF when 
they mean IFF, so it is vital to understand the correct technical usage of these words. We 
close the section by introducing tautologies (propositional forms that always evaluate to 
true) and contradictions (propositional forms that always evaluate to false). 


Translations of P > Q 


The most straightforward English translation of the propositional form P > Q is “if P, 
then Q.” However, there are many other English phrases with the same logical meaning 
as P > Q. We begin by considering some translations of IF that often cause trouble. By 
definition: 


e | P if Q| means Q => P|. 


e | P only if Q| means| P > Q|. 


The phrase “P if Q” is obtained by changing the order of the clauses in “if Q then P.” In 
both phrases, the proposition Q immediately follows the word IF, so Q is the hypothesis 
and P is the conclusion of the conditional statement. However, replacing IF by ONLY IF 
changes the meaning: in the sentence “P only if Q,” P is the hypothesis and Q is the 
conclusion! We explain this counterintuitive phenomenon through an example. Suppose a 
professor says to a student, “You will pass my class ONLY IF you take the final exam.” 
This statement means the same thing as, “If you do not take the final exam, then you will 
not pass the class.” We recognize this as the contrapositive of the statement, “If you pass 
the class, then you took the final exam.” The same reasoning holds in general: “P only if Q” 
really means “if Q is not true, then P is not true,” which can be encoded as (~Q) > (~P). 
This is the contrapositive of P > Q, so this statement is logically equivalent to P > Q. 
Now consider the meaning of the words NECESSARY and SUFFICIENT. By definition: 


e | P is sufficient for Q | means | P > Q|. 


e | P is necessary for Q | means | Q => P|. 


For example, “Differentiability of f is sufficient for continuity of f” means “If f is differ- 
entiable, then f is continuous.” On the other hand, the word NECESSARY (like the word 
ONLY) has a hidden negating effect. The statement “Taking the final exam is necessary 
for passing the course” means, “If you do not take the final exam, then you will not pass 
the course,” which in turn has the same meaning as, “If you pass the course, then you took 
the final exam.” In general, saying “P is necessary for Q” means that if P does NOT hold, 
then Q does NOT hold, which can be encoded as (~P) = (~Q). This is the contrapositive 
of Q => P, so this statement has the same logical meaning as Q => P. 

One way to remember the correct meaning of NECESSARY is to note that “P is nec- 
essary for Q” is the converse of “P is sufficient for Q.” Similarly, “P only if Q” is the 
converse of “P if Q.” We must take care, however, because the passive voice (and similar 
constructions that change the order of phrases in a sentence) can reverse the positions of P 
and Q. For example, all of the following phrases can be used to translate P > Q: 
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e P is sufficient for Q. 

e For Q to be true, it is sufficient that P be true. 

e A sufficient condition for Q is P. 

e Q is necessary for P. 

e A necessary condition for P is Q. 

e For P to be true, it is necessary that Q be true. 


Notice the key preposition FOR, which can help you locate the correct hypothesis and 
conclusion in each phrase. Yet other possible translations of P = Q include: 


e P implies Q. 
e Q is implied by P. 


e When P is true, Q is true. 


Q whenever P. 


e Q follows from P. 


Q is a consequence of P. 


In particular, when reading a form such as P > Q aloud, you may read the symbol = 
as the word IMPLIES. But it is wrong to read = as the word IF, since “P if Q” means 
Q => P, not P = Q. It is correct, but potentially confusing, to pronounce the symbol = as 
ONLY IF. My personal recommendation is to read P > Q as “if P, then Q.” But you must 
still be familiar with all of the variants above, since they occur frequently in mathematical 
writing. 

While we are discussing translation issues, we should mention the logical meaning of a 
few other common English phrases: 


e “P but Q” means PAQ. 

e “Although P, Q” means PAQ. 

e “P, Q are both true” means P A Q. 

e “Neither P nor Q” means (~P) A (~Q), or equivalently ~(P V Q). 


The words BUT and ALTHOUGH suggest some kind of contrast between the two clauses 
joined by these words, but this contrast is ignored in logic. For logical purposes, BUT and 
ALTHOUGH (and BOTH) have exactly the same meaning as AND. 

There are some other English phrases whose mathematical meaning does not seem to 
be completely standardized. I assert that “either P or Q” means P V Q (inclusive-OR), “P 
provided that Q” means Q => P, and “P unless Q” means P V Q, but not all mathematical 
writers agree with these conventions. 
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What IFF Means 


Now that we understand IF and its many synonyms, it is time to define the related logical 
connective IFF (pronounced “if and only if” and symbolized by =). The formal definition 
of this word appears in the truth table below. 


1.22. Definition of IFF. For any propositions P and Q, the truth value of P= Q 
(“P iff Q”) is determined by the following table. 


PQ | PQR 
T T T 
T F F 
F T F 
F F T 


Remember this table by noting that P <= Q is true exactly when the two inputs P, 
Q have the same truth values; P 4 Q is false when the inputs P, Q have different truth 
values. Comparing the defining truth tables for IF and IFF, we see that P> Q 4 PS&Q. 


1.23. Example. True or false? 

a) 1+1=3 only if1+1=2. 
b)0<1if1<0. 
c)0<1iff1<0. 
d) A necessary condition for 7 to be even is that 5 be odd. 
e) For 7 to be even, it is sufficient that 5 be odd. 
f) 5 is negative iff —5 is positive. 
Solution. The suggested approach is to encode each English sentence symbolically, then 
refer to the definitions of IF and IFF. Thus, (a) becomes “1 +1 = 3 > 1+1 = 2,” which is 
true since F > T is true. Similarly, (b) becomes “1 < 0 = 0 < 1,” which is true. But (c) is 
“0 <14 1x< 0,” which is false since T © F is false. (d) says that “7 is even => 5 is odd,” 
which is true. (e) says that “5 is odd = 7 is even,” which is false. Finally, (f) is true since 
both inputs to the © operator are false. 


1.24. Theorem on IFF. For all propositional forms P and Q: 

(a) Reduction to IF: P & Q = (P> QA (Q =P). 

(b) Reduction to AND and OR: P & Q = (PAQ) V ((~P) A (~Q)). 

(c) Symmetry of IFF: P& Q=Q<P. 

(d) Denial of IFF: ~(P = Q) = PEQ. 

(e) Negating Both Sides: P & Q = (~P) © (~Q). 

Part (a) relates IFF to IF and explains why the symbol © is used for IFF. Part (b) lets us 


replace IFF by a form built from other logical connectives. 


As with previous theorems, we prove these logical equivalences with truth tables. As a 
sample, we prove parts (a) and (e) as shown: 


P Q|PQ P>Q Q>P (P>Q)AQ>P)|~P_~Q_(~P) S (~Q) 
TT T T. T T F F T 
T F| F F T F F T F 
FT| F T F F T F F 
F F| T T T T T T T 


Columns 3 and 6 agree, proving (a); columns 3 and 9 agree, proving (e). 
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1.25. Translations of IFF. We can translate P © Q by several English phrases, including: 
(a) P iff Q. 

(b) P if and only if Q. 

(c) P is necessary and sufficient for Q. 

(d) P when and only when Q. 

(e) P precisely when Q. 

(f) P is equivalent to Q. 

(g) P has the same truth value as Q. 

Regarding (b), note that “P if and only if Q” is short for “P if Q, and P only if Q,” 
which translates to (Q => P) A (P => Q). We know this is equivalent to P = Q by part (a) 
of the theorem and commutativity of A. A similar analysis applies to the phrases “necessary 
and sufficient” and “when and only when.” 

From part (e) of the Theorem on IFF, negating both sides of an IFF-statement does not 
deny the entire statement, but actually produces a logically equivalent statement. To deny 
an IFF-statement, we must replace the IFF by XOR, as stated in part (d) of the theorem. 
By negating each side of the equivalences in the Theorem on IFF, or by constructing more 
truth tables, we obtain the following equivalences for XOR. 


1.26. Theorem on XOR (Optional). For all propositional forms P, Q, R: 
(a) Reduction to AND and OR: PẸ Q = (PA (~Q)) V (QA (~P)). 

(b) Negating Both Sides: P Q = (~P) @ (~Q). 

(c) Second Reduction to AND and OR: PEQ = (PV Q)A ((PA Q)). 
(d) Commutativity of XOR: PEQ = QP. 
( 
( 
( 


e) Denial of XOR: ~(P®Q) = PSQ. 
f) Reduction to IFF: PẸ Q = PS (Q) = (~P) Q. 
g) Associativity of XOR: P E (Q9 R) = (PEQ) R. 


Part (a) shows that we can translate P ® Q as “exactly one of P and Q is true.” Part (c), 
which we proved in an earlier section, justifies the translation of P 6 Q as “P or Q, but 
not both.” Part (f) indicates how to transform XOR-statements into IFF-statements by 
negating one of the inputs. 


Tautologies and Contradictions 


Certain propositional forms, called tautologies and contradictions, play a special role in 
logic and proofs. These are defined as follows. 


1.27. Definition: Tautologies and Contradictions. A propositional form A is called a 
tautology iff every row of the truth table for A has output true. A propositional form $ is 
called a contradiction iff every row of the truth table for B has output false. 


Note that most propositional forms have a mixture of true and false outputs, so most 
propositional forms are neither tautologies nor contradictions. 


1.28. Example. The propositional form RV (~R) is a tautology, whereas the form RA (~R) 
is a contradiction, as shown in the following truth table: 


R| ~R RV (~R) RA(~R) 
T| F T F 
FÍT T F 
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Intuitively, “R or not R” must be true, regardless of the truth value of R, whereas 
“R and not R” must be false. More generally, for any propositional form A (not just the 
individual variable R), AV (~A) is a tautology, whereas A A (~A) is a contradiction. 


1.29. Example. Classify each propositional form as a tautology, a contradiction, or neither. 
(a) P+(QSP) b) (P>Q)>P (0) (PAQ)A(PSQ) 

Solution. In each case, we make a truth table for the given form and see if every output 
is true (for a tautology) or false (for a contradiction). The truth table for (a) and (b) is 
shown here: 


P Q|Q>P Ps(Q=P)|PsQ (P=>=Q) >P 
T T T T T T 
T F T T F T 
F T F T T F 
F F T T T F 


Column 4 is true in every row, so (a) is a tautology. Column 6 has a mixture of Ts and 
Fs, so (b) is neither a tautology nor a contradiction. For (c), we make the following truth 
table: 


P Q[P6Q PQ (PEQAPSO 
T T F T F 
T F| T F F 
F T) T F F 
F F| F T F 


The final output is always false, so (c) is a contradiction. 


Section Summary 


1. All of the following phrases mean P => Q: if P then Q; P implies Q; Q if P; 
P only if Q; when P, Q; Q whenever P; P is sufficient for Q; Q is necessary for 
P; and other variations involving passive voice or changes in word order. 


2. P & Q (“P iff Q”) is true when P and Q are both true or both false; P = Q 
is false when exactly one of P and Q is true. Some useful equivalences involving 
IFF include: 


PSQ = (P>Q)A(Q> P) PSQ = (PAQ)V((~P)^(~Q)), 
P&Q= (~P) (QQ) = QSP, (PSQ)=PeQ. 


3. “P only if Q” is the converse of “P if Q.” “P is necessary for Q” is the converse 
of “P is sufficient for Q.” So, P 4 Q can be translated “P if and only if Q” or 
“P is necessary and sufficient for Q.” Other possible translations include: 
P iff Q; P precisely when Q; P is equivalent to Q. 

4. In logic, the words BUT, ALTHOUGH, and BOTH have the same meaning as 
AND. “Neither P nor Q” means (~P) A (~Q). 


5. A tautology is a propositional form that has output true in all rows of its truth 
table. A contradiction is a propositional form that has output false in all rows of its 
truth table. Most propositional forms are neither tautologies nor contradictions. 
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Exercises 
1. Use truth tables to prove parts (b), (c), and (d) of the Theorem on IFF. 
2. Which expressions below are logically equivalent to the statement “if P then Q”? 
P=Q Q=>P PQ 
P implies Q. (~P) = (~Q) (~Q) => (~P) 
(~P) & (~Q) (PAQ)V (WP) A(~Q)) (~P)VQ 
~(P@Q). PifQ. Q if P. 
P iff Q. P is sufficient for Q. P is necessary for Q. 
P is nec. and suff. for Q. (P=>Q)A(Q=> P) QSP 
3. Which expressions listed in the previous exercise are logically equivalent to the 
statement “P if and only if Q”? 
4. Use truth tables to classify each propositional form below as a tautology, a con- 
tradiction, or neither. 
(a) P< (~P) (b) P=>(~P) (c) (P®Q) @ ((~P) Q) 
(d) QA (~(P = Q)). 
5. Is each propositional form a tautology, a contradiction, or neither? Explain. 
(a) (PAQ)>Q_ (b) (PVQ)+Q 
(c) (Q => (PVQ)) (d) (P eQ) e (Qe P) 
(e) (PS Q) A (P S (~Q)) (H P > Q)^ (P > (~Q)). 
6. Rewrite each statement, replacing all logical connective words (such as IF, BUT, 
SUFFICIENT, and so on) with appropriate logical symbols (such as =). 
a) If £o is even but prime, then xo = 2. 
[Answer. “(ap is even A zo is prime)=> xo = 2.”] 
b) f is continuous only if f is bounded. 
c) x is real whenever z is rational. 
d) A necessary condition for m to be perfect is that m be even. 
e) X is compact iff X is pow closed and bounded. 
f) Although x > 2 implies z? > 4, the converse is not true. [Don’t use the word 
“converse” in your answer.| 
g) When f has compact domain, continuity of f is equivalent to uniform conti- 
nuity of f. 
h) Continuity of f is sufficient but not necessary for integrability of f. 
7. Write seven different English sentences with the same meaning as “If AC and 
BC are equal in length, then ZA = ZB.” 
8. Write five different English sentences equivalent to “xo is odd iff xo + 3 is even.” 
9. Rewrite each statement in the standard form “If P, then Q.” 
a) G cyclic implies G is commutative. 
b) Normality of X is sufficient for regularity of X. 
c) Measurability of f is necessary for continuity of f. 
d) f is integrable only if f is bounded. 
e) For K to be path-connected, it is necessary that K be connected. 
f) A sufficient condition for T to be linear is that T be orthogonal. 
10. True or false? (It may help to first rewrite complex statements in symbolic form.) 
a) Columbus discovered America in 1942 iff World War I ended in 1981. 
b) Mongolia is a vegetable if Paris is a city. 
c) Mongolia is a vegetable iff Paris is a city. 
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d) Snakes are not reptiles whenever 2 + 2 = 4. 

e) 3 is neither even nor positive. 

f) Grand pianos are heavier than oboes only if cannonballs are lighter than 
feathers. 

g) For Venus to be a planet, it is necessary that 1 + 1 = 3. 

h) 1+1 = 2 is a sufficient condition for anvils to float. 

i) Although whales are mammals, they live in the ocean. 

j) January has 32 days only if March has 31 days. 

k) For Hillary Clinton to be 45th President of the U.S., it is sufficient that George 
Washington was the 1st President of the U.S. 


For each sentence below, write a logically equivalent sentence using the word 
NECESSARY. 

a) If x is rational, then z is real. 

b) For f to be continuous, it is sufficient that f be differentiable. 

c) det(B) Æ 0 implies B is invertible. 

d) G is cyclic only if G is commutative. 

e) A sufficient condition for T to be isosceles is that T be equilateral. 


For each sentence below, write a logically equivalent sentence using the phrase 
ONLY IF. 

(a) Whenever n ends in the digit 8, n is not prime. 

b) If F is a field, then F is an integral domain. 

c) A necessary condition for a sequence (an) to converge is that (an) be bounded. 
d) Having a zero column is sufficient for a matrix B to be singular. 

e) K is compact or K is connected. 

a 


) Prove that IFF is transitive by showing that [(P © Q)A(Q © R)| > (Ps R) 
is a tautology. (b) Is the converse of (a) a tautology? Explain. 


( 
( 
( 
( 
( 


Is IFF associative? In other words, is (P = Q) © R logically equivalent to 
P & (Q & R)? Explain. 


True or false? 

a) For 57 to be negative, it is necessary and sufficient that George Bush was the 
1st President of the United States. 

b) Peking is the capital of Venezuela only if Boise is the capital of Idaho. 

c) 3 is odd if and only if the positivity of 7 implies that 9 is even. 

d) Whenever pigs can fly or fish have gills, it is false that money grows on trees 
iff Los Angeles is located in California. 

e) If R is paracompact, then red is a color. 


True or false? Explain. 

a) Every propositional form is either a tautology or a contradiction. 

b) The negation of P © Q is logically equivalent to “P or Q.” 

c) ~(P => Q) is logically equivalent to (~P) @ (~Q). 

d) “P is necessary for Q” is logically equivalent to the converse of “P is sufficient 
for Q.” 

e) “P only if Q” is logically equivalent to the contrapositive of “Q if P.” 


a) Prove parts (a) and (b) of the Theorem on XOR using the Theorem on IFF and 
known logical equivalences (not truth tables). (b) Prove the rest of the Theorem 
on XOR by any convenient technique. 


Suppose A, B, and C are propositional forms such that B = C. (a) Prove the 
substitution rule (A = B) = (A & C) using a meta-truth table (see Remark 5 on 
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page 1.2). (b) Deduce from (a) that (B = A) = (C © A) without using a truth 
table. 


Let A and B be any propositional forms. (a) Suppose A = B. Show that A= B 
is a tautology. (b) Now suppose A & B is a tautology. Show that A= B. 


For each propositional form, find a logically equivalent form that only uses the 
logical symbols ~ and V. (a) P>Q (b)~(PAQ) (C) PAQ (da) PEQ. 


For each propositional form, find a logically equivalent form that only uses the 
connectives ~ and A. (a) ({P>Q) ÞP>Q (c)PVQ (d)PSQ 
(e) PEQ. 

For each propositional form, find a logically equivalent form that only uses the 
connectives ~ and >. (a) PVQ (b)PAQ (ce) PSQ_ (QA PEQ. 


Write the converse and the contrapositive of each statement. Give the truth value 
of the statement, its converse, and its contrapositive. 

a) If milk is a beverage, then chowder is a soup. 

b) If 9 is negative, then 10 is odd. 

c) If 1+1 = 5, then the cosine function is continuous. 

d) Sugar is sweet if lemons are sour. 

e) A necessary condition for pigs to fly is that eagles oink. 

f) For m to be negative, it is sufficient that —1000 be positive. 

g) An argon atom is lighter than a helium atom whenever a sodium atom is 
heavier than a hydrogen atom. 

h) If 7 is positive iff 8 is odd, then 9 is a square iff 10 is even. 

i) The oddness of 13 implies that 5 > 11 only if the liquidity of water at room 
temperature is a sufficient condition for fish to have lungs. 


Show that any propositional form using only the logical connectives ~ and ® 
cannot be logically equivalent to P A Q. 
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1.4 Tautologies, Quantifiers, and Universes 


Elementary logic has two main parts: the logic of propositions and the logic of quantifiers. 
After a few more examples to round out our treatment of propositional logic, we begin our 
discussion of quantifiers in the second part of this section. Intuitively, quantifiers allow us 
to extend the propositional connectives AND and OR to apply to variables that can range 
over potentially infinite sets of values. 


More on Tautologies and Contradictions 


Recall that a tautology is a propositional form that is always true, regardless of the truth 
values of the variables appearing in the form; similarly, a contradiction is a propositional 
form that is always false. Recall also that two propositional forms are logically equivalent 
iff they have the same truth value for all possible values of the input variables. 


1.30. Example. Show that the following propositional form is a tautology: 
(P & Q) > (PAR) & (QA R)). 


Solution. Call the given propositional form A. We make an eight-row truth table, shown 
here: 


PSQ PAR QAR (PAR S(QAR) 


hy fy LD 
=a Aa HA Ho 
Maa yal 
HHA aad 
HHHHHHHH S 


HAA 
gs ies ieo ies ies iz Bes ia! 
gs ieo ies i ies ies Bes ia! 


We fill in the Ts in columns 5 and 6 by looking for two Ts in the appropriate input 
columns; the remaining entries in these columns are Fs. We fill in columns 4 and 7 by looking 
for agreement between the two input columns, putting Ts in these rows, and putting Fs 
elsewhere. Finally, we fill column 8 by looking for places where column 4 is true and column 
7 is false. No such places exist, so column 8 is all Ts. This shows that A is a tautology. 


1.31. Example. Let A be any propositional form, and let C be any contradiction. 

(a) Show that AAC is a contradiction. 

(b) Show that AVC = A. 

(c) Show that ((~A) = C) > A is a tautology. 

Solution. We cannot make the actual, complete truth tables for the propositional forms 
appearing here, since we do not know which specific propositional variables appear within 
the forms A and C. However, we can still make a table showing the possible truth values 
of various expressions built from A and C. The key point is to recall that C must be false, 
whereas (without further information about A), A might be true or false. So we prepare 
the following two-row table: 


A_C|[AAC|AVC|~A (ASC) (AS ORDA 
T F F T F T T 
F F F F F T 
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For any assignment of truth values to the propositional variables appearing within A 
and C, A must be true or false, whereas C must be false. Column 3 therefore proves that 
AAC must also be false in all situations, so this form is a contradiction. Comparison of 
columns 1 and 4 shows that AVC always has the same truth value as A, hence these two 
forms are logically equivalent. Finally, the last three columns show that the form in (c) is 
always true, so this form is a tautology. We return to this tautology later, using it to justify 
a method of proof called proof by contradiction. 


1.32. Remark. By arguments similar to those in the last example, you can establish the 
following facts. Let A and B be any propositional forms, and let 7 be any tautology. 

(a) AAT =A. 

(b) AV T is a tautology. 

(c) A is a tautology iff ~A is a contradiction. 

(d) A= Biff A & B is a tautology. 

Part (d) lets us harvest many tautologies from the logical equivalences proved earlier. For 
instance, the logical equivalence P ^A Q = QA P leads to the tautology (P AQ) = (QAP). 


Variables and Quantifiers 


Quantified logic studies the truth or falsehood of statements containing variables such as x 
and y. To see how this differs from propositional logic, consider the following example. 


1.33. Example. True or false? 

(a) For all real numbers x, x? = g. 

(b) There is a real number g satisfying x? = x. 

(c) z? = x (where z is a variable varying over all real numbers). 

(d) 22 = xo (where zo is a constant, designating one particular real number). 

Intuitively, (a) is false, since the real number x = 2 does not satisfy x? = x (as 2? = 4 £ 2). 
Similarly, (b) is true, since 2 = 1 satisfies 1? = 1 (note x = 0 works also). Statement (c) is 
a bit problematic: if the variable x has not been assigned a particular value, but is varying 
through all real numbers, then (c) has no truth value (it is not a proposition). Put another 
way, (c) is true for some values of the variable x and false for other values of x, so statement 
(c) has no definite truth value of its own. In contrast, the constant zo in (d) represents one, 
particular, fixed real number. So x2 = xo is a proposition, although we cannot tell without 
more information about xo whether this proposition is true or false. 


In general, a statement containing a variable x, such as statement (c) above, is called 
an open sentence and is denoted by a symbol such as P(x), Q(x), etc. Open sentences can 
involve more than one variable; for instance, we could let R(x, y) be the formula x7+y? = 25. 
An open sentence does not have a truth value; it is not a proposition. One way to turn an 
open sentence into a proposition is to assign specific values to every variable appearing in it. 
In our example, R(5,0) is the true proposition 5? + 0? = 25; R(2, 4) is the false proposition 
2? + 4? = 25; R(3,—4) is true; R(7,10) is false; and so on. R(3,z) is the open sentence 
3? + 2% = 25 involving the new variable z. We can also substitute letters that represent 
constants into an open sentence. In this text, the letters a,b,c and letters with subscripts 
(like xq) denote constants. Thus R(xo,b) is the proposition zê + b? = 25, but the truth 
value of this proposition cannot be determined without more information about x9 and b. 

There are two other ways to turn an open sentence P(x) into a proposition, which are 
illustrated in (a) and (b) above: we may quantify the variable x with a phrase such as “for 
all x” or “there exists an x.” We use the universal quantifier symbol V to abbreviate “for 
all,” and the existential quantifier symbol 3 to abbreviate “there exists.” These symbols are 
explained further in the next two definitions. 
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1.34. Definition of V (Universal Quantifier). Let P(x) be an open sentence involving 
the variable x. The statement “Vz, P(x)” is a proposition that is true iff for all objects zo, 
P(xo) is true. The quantifier Vz may be translated “for all x,” “for every x,” “for each x,” 
“for any x,” among other possibilities. The symbol V looks like an upside-down capital A, 
reminding us of the word ALL. 


1.35. Definition of 3 (Existential Quantifier). Let P(x) be an open sentence involving 
the variable z. The statement “3x, P(x)” is a proposition that is true iff there is at least one 
object zo making P(xo) true. The quantifier Jx may be translated “there is an x,” “there 
exists an x,” “there is at least one x,” “for some x,” among other possibilities. The symbol 
J looks like a backwards capital E, reminding us of the word EXISTS. 


Universes; Restricted Quantifiers 


In most discussions, we do not want to be making statements about literally all objects 
x. Instead, we want to restrict attention to a particular set of objects, such as the set of 
real numbers, or the set of integers, or the set of continuous real-valued functions. A set U 
of objects that contains all objects we want to discuss in a particular situation is called a 
universe or universal set. We write x € U to mean that x is a member of the universal set 
U, and we write x ¢ U to mean that x is not in U. (This notation comes from set theory, 
which we study in Chapter 3.) By combining this notation with quantifiers, we obtain the 
restricted quantifiers defined next. 


1.36. Definition: Restricted Quantifiers. Let U be any set, and let P(x) be an open 
sentence involving x. The statement “Vx € U, P(x)” means that for every object zo in U, 
P(ao) is true. The statement “Ja € U, P(x)” means there exists at least one object zo in 
U for which P(ao) is true. The symbols “Vz € U” and “Jx € U” are restricted quantifiers; 


in contrast, the previous symbols “Va” and “Jx” are unrestricted quantifiers. 


Next we introduce some notation for certain universes of numbers that arise frequently 
in mathematics. 


1.37. Definition: Notation for Number Systems. 

(a) Z is the set of all integers, namely Z = {...,—3, —2, —1, 0, 1,2,3,...}. 

(b) Q is the set of all rational numbers, which are ratios a/b where a is an integer and b is 
a nonzero integer. 

(c) R is the set of all real numbers. 

(d) C is the set of all complex numbers, which are expressions a+ bi with a, b real, and where 
i satisfies i? = —1. 

(e) We write Z>o = {0,1,2,3,...} for the set of nonnegative integers, and Z>o = {1,2,3,...} 
for the set of positive integers. Some authors use N (the set of natural numbers) to denote 


the set Z>9, whereas other authors use N to denote the set Z.9. To avoid confusion, we do 
not use the symbol N except in §8.3. 


(£) By analogy with (e), let Ryo be the set of strictly positive real numbers (not including 
zero). For a fixed integer b, let Z>, be the set of integers that are at least b, and let Z>, be 
the set of integers strictly greater than b. We similarly define Zz»), Ryz,, R<z,, and so on. 


(g) Given n distinct objects 71, £2,..., £n, let U = {£1, £2,..., £n} denote the finite uni- 
versal set consisting of these n objects. 


This definition is merely setting up notation. In Chapter 8, we give more precise defini- 
tions as part of a formal development of the number systems Z, Q, R, and C. 
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. Example. True or false? 
z£EZ,c£+3=2. 

z E€ Z>0o,£ +3=2. 

R, z? = —1. 

z eC, r? =—-1. 
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Jx € Z, g? = q. 

(j) Yz € {-1,0,1}, 23 = g. 

Solution. First, (a) is true, since x = —1 is in Z and satisfies x + 3 = 2. In fact, x = —1 is 
the only real solution of this equation. So (b) is false, since —1 is not in the universe Z>o. 
Similarly, (c) is false, since the square of any real number is at least 0, so cannot equal —1. 
But (d) is true, since x = i is a complex number making the equation true (x = —i also 
works). Adding 5 to any positive integer produces an integer 6 or greater, which is also 
greater than 3, so (e) is true. But (f) is false, since z + 5 > 3 fails to hold for the rational 
number x = —11/2 (among many other examples). Similarly, (g) is false. Next, (h) is false 
since 2 € Z and 2? = 2 is not true. But (i) is true, since 1 is an integer such that 1° = 1. 
Also (j) is true, since each of the three objects b in the finite universe {—1, 0, 1} does satisfy 
b? = b. These examples show that changing to a different universe can affect the truth of 
a statement with a restricted quantifier. The type of quantifier (4 or V) certainly makes a 
big difference as well. We also remark that it would not make sense to use unrestricted 
quantifiers in statements (a) through (j). For, if x is an arbitrary object (not necessarily a 
number), it is not clear what statements such as x + 3 = 2 and x? = x should mean. 


Quantifiers vs. Logical Operators 


The quantifiers V and 4 can be viewed as generalizations of the logical operators ^ 
(AND) and V (inclusive-OR), respectively. To see why, consider a finite universe U = 
{21, Z2,---,2n}. The universal statement “Va € U, P(x)” is true iff P(a) is true for each 
fixed object a € {z1,...,2n}. In other words, “Vx € U, P(x)” is true iff P(z1) is true and 
P(z2) is true and ... and P(z,) is true. Similarly, “da € U, P(x)” is true iff there exists an 
object b € {z1,...,2n} making P(b) true, which holds iff P(z1) is true or P(z2) is true or 
... or P(Z,) is true. In summary, 


Va E {21, 22,---;2n}, P(x) = P(z1) A P(z2) A+++ A Plén) § 


dx € {21, 22,..-,2n}, P(x) = P(z1) V P(z2) V -++ V P(Zn) | 


In the general case of an infinite universe U, “Vx € U, P(x)” is a statement that (informally 
speaking) AND’s together the infinitely many statements P(a) as a ranges through all 
objects in U. Similarly, “Sa € U, P(x)” is a statement that OR’s together all the statements 
P(b) for b ranging through U. This analogy linking V with A and 4 with V can help us 
understand and remember some of the quantifier properties presented later (such as the 


negation rules in the next section). 


Section Summary 


1. Given any tautology 7, any contradiction C, and any propositional form A, 
AAT =A=AVC; AVT is a tautology; AAC is a contradiction. For any forms 
A and B, A=B iff A & B is a tautology. 
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2. The universal quantifier V and the existential quantifier 4 have the following 


meaning. 
Symbol | Meaning 
Va, P(x) For all objects xo, P(x) is true. 
Va € U, P(x) | For all objects xo in U, P(x) is true. 
dx, P(x) There exists an object xo such that P(xo) is true. 


Jx € U, P(x) | There exists an object xo in U such that P(ao) is true. 
Words such as “all,” “any,” “each,” and “every” signal the use of V; phrases such 
as “there is,” “there exists,” “for some,” and “at least one” signal the use of 4. 
Common choices of the universal set U include Z (integers), Q (rational numbers), 
R (real numbers), Z>o (nonnegative integers), Z>o (positive integers), and finite 
sets. 


3. For finite universes, quantifiers can be replaced by propositional logic operators 
as follows: 


Va E {21, 22,.--,2n},P(x@) & Pla 
Jr € {z1,22,..-,2n},P(x) & Pla 


Sn 


JA a 
)V P(z2) V-V P(zn). 


Exercises 


1. Is each propositional form a tautology, a contradiction, or neither? Explain. 
a) (P = (Q => R) A (P> Q)A (PA (~R)). 

b) (PVR) (QV R)) > (PS Q). 

c) (P & Q) => ((R => P) © (R= Q)). 

2. Is each propositional form a tautology, a contradiction, or neither? Explain. 
a) (PVQA(P => RAQ => RSR. 

b) (P+ Q) © R] $ [P $ (Qe R)|. 

c) [P > (Q => R) > [(P > Q) > (P > R)]. 

3. Prove Remark 1.32. 


4. Prove: for any propositional forms A and B, if A & B is a tautology, then A > B 
is a tautology. Is the converse always true? 


5. Let T be a tautology, C be a contradiction, and A be any propositional form. 
Prove: (a) A => T is a tautology. (b) ASC = A. (c) AS T = (~A). (I) TSC 


is a contradiction. 


6. Rewrite each sentence using symbols, not words. (a) The square of any real num- 
ber is nonnegative. (b) There is an integer whose cube is two. (c) Every positive 
rational number has the form m/n for some positive integers m and n. 


7. Rewrite each statement using words, not symbols. 
a) Vz € Q, £? 47. (b) Jy € R,Wv € R, yv =v. (c) Ja € Z, Ib € Z, 25 = a? + b’. 
8. True or false? (a) Ix € Z, x? = 2. (b) Ix € R, x? = 2. (c) Yz € Q, x? > 0. 

d) Vx € Z, £? > x. (e) Vx € R, x? > x. (f) Jx € Z, z? — 3x — 10 = 0. 

9. True or false? Explain. 

a) Vx € {3,5,7}, x is an odd integer. 

b) Yx, x € {3,5,7} > a is an odd integer. 

c) Yx, x € {3,5,7} A z is an odd integer. 

d) Jx € {3,5,7}, x is an even integer. 
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e) 3x, x € {3,5,7} A z is an even integer. 
f) 3x, x € {3,5,7} > z is an even integer. 


True or false? Explain. 

a) Va € Z, 4x = 7. (b) Yx, x € Z > 4g = 7. (c) Yx, EZA 4r =7. 

d) Jx € Z, 4x = 7. (e) Ix, x € ZA 4x = 7. (f) Ix, x € Z > 4r = 7. 

For which universes U is the statement Vx € U, dy € U, x + y = 0 true? 

a) Z>o (b)Z (c)Q (AR (e){0} (£) {-1,0,1} 

For which universes U is the statement Va € U, Jy € U, xy = 1 true? 

a) Z>o (b) Q>o (c) R>o (da)Z (eR (f {-1} 

a) Rewrite the statement Yx € {—1,0, 1}, x3 = x without using quantifiers. (b) 
Rewrite the statement Jx € {1,2,3}, £? = 2x without using quantifiers. 

c) Rewrite the statement “11 is prime and 13 is prime and 17 is prime and 19 is 
prime” using a quantifier instead of the word AND. 


Let xo be a fixed real number, and let x be a variable ranging through real 
numbers. (a) Explain the logical difference between these two sentences: I. “if 
xo > 0, then x2 > zo.” I. “if x > 0, then z? > x.” (b) One of the sentences in (a) 
is not a proposition. Use quantifiers to turn this sentence into a proposition in 
two different ways, and state whether these propositions are true or false. (c) Do 
the truth values of the propositions in (b) change if we restrict x to vary through 
Z instead of R? 


(a) Express the statement Vx, P(x) using only 3 and propositional logic operators. 
(b) Express the statement Jx, Q(x) using only V and propositional logic operators. 


Suppose U and V are sets such that every member of U is also a member of V. 
Which statements below must be true for every open sentence P(x)? For those 
that are true, explain why. For those that are false, illustrate with a specific choice 
of U, V, and P(x). 

(a) Yx € U, P(x) implies Yz € V, P(a). 

(b) Va € V, P(x) implies Vx € U, P(x). 

(c) da € U, P(x) implies 3x € V, P(x). 

(d) da € V, P(x) implies 3x € U, P(x). 

We know Q V (~Q) is a tautology. Which statements below must be true for all 
choices of U and P(x)? Explain. 

a) Vz € U, [P(x) v (~P(«))].- 

b) (Va € U, P(x)) V (Va € U,~P(2)). 

c) (Va € U, P(x))V ~(Va € U, P({x)). 


We know Q A^ (~Q) is a contradiction. Which statements below must be false for 
all choices of U and P(x)? Explain. 

a) Jx € U, [P(x) A (~P(x))]. 

b) (dx € U, P(x)) A (Ar € U, ~P(x)). 

c) (Ex € U, P(x))A ~(aa € U, P(x)). 

True or false? Explain each answer. 

a) For all propositional forms A and all contradictions C, C = A is a tautology. 
b) For all propositional forms A and B, A => B is not logically equivalent to 
B > A. 

c) Any two contradictions are logically equivalent. 

d) For all logically equivalent forms A and B, (A > B) = (A & B). 


Show that any propositional form using only the logical connectives ^A and V is 
neither a tautology nor a contradiction. 
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1.5 Quantifier Properties and Useful Denials 


This section develops more properties of the existential quantifier J and the universal quan- 
tifier V. We look at rules for converting restricted quantifiers to unrestricted quantifiers, 
translation examples illustrating how English phrases encode quantifiers, and rules for 
transforming negated quantified statements. We then discuss how to find useful denials 
of complex statements, which is a crucial skill needed to analyze and create proofs. 


Conversion Rules for Restricted Quantifiers 


We start with some rules for eliminating restricted quantifiers. Let U be any set, and consider 
the statement Jx € U, P(x). This statement is true iff there is an object zo in the set U for 
which P(2 9) is true. Intuitively, this condition holds iff there is an object xo (not initially 
required to be in U) for which “xp € U and P(xo)” is true. Therefore, we obtain the rule 


da € U, P(x) iff Jz, (x € U A P(x)) 


for converting a restricted existential quantifier to an unrestricted existential quantifier. The 
next example shows that this rule does not work if we blindly replace both 3 symbols by V 
symbols. 


1.39. Example. Let U be the set of polar bears, and let P(x) be the open sentence “x 
is white.” The two statements da € U, P(x) and 3x, (x € U A P(x)) both assert that a 
white polar bear exists. Similarly, Vz € U, P(x) says that all polar bears are white, which 
(ignoring evolutionary anomalies) is a true proposition. But Vz, (x € U A P(x)) means 
something entirely different: this statement says that every object x is a white polar bear! 

We can fix the example by replacing A (AND) by = (IMPLIES). Consider the statement 
Va, (x € U => P(x)). This says that for all objects xo, IF xo is a polar bear, THEN Zo is 
white. Imagine testing the truth of this statement by considering each object xo in the 
whole universe, one at a time. Some objects xp are polar bears, and for these objects the 
IF-statement is true (having a true hypothesis and true conclusion). All remaining objects 
Xo are not polar bears, and for such objects the IF-statement is automatically true (since 
its hypothesis is false). Thus these objects are essentially irrelevant when determining the 
truth of the quantified statement. 


The rule suggested by the example holds in general: 


Va € U, P(x) iff Va, (x € U => P(a))}. 


Intuitively, if the left side is true, then the implication on the right side must be true for 
each fixed object xo satisfying the hypothesis x) € U. But the implication is always true 
for all other objects xg (those making the hypothesis xo € U false), so the right side is true. 
Similarly, if the right side is true, then applying the right side to each object x9 € U shows 
that P(xo) is true for all such objects zo. So the left side is also true. Later, after discussing 
proof methods, we will see that this paragraph is essentially a proof of the conversion 
rule stated above. We also remark that a major reason for defining the IF truth table as 
we did was to make sure that this quantifier rule would work (compare to our discussion 
of the statement Vz € R,x > 2 > x? > 4 on page 11; this statement is equivalent to 
Va € Ryo, £? > 4). 
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Translation Examples 


Here are some examples illustrating the process of translating back and forth between En- 
glish statements and logical symbolism. We use these universes and open sentences: U is 
the set of roses; V is the set of violets; C is the set of carrots; R(x) means x is red; P(x) 
means x is purple; O(a) means x is orange; and B(x) means z is beautiful. In the first few 
examples, we encode English statements, first using restricted quantifiers, and then con- 
verting to unrestricted quantifiers. 


(a) “All violets are purple.” The word “all” signifies a universal statement, so we write 
Va € V, P(x). Eliminating the restricted quantifier gives Va, (x € V > P(x)). 


(b) “Some roses are orange.” Here, “some” indicates an existential statement, so we get 
da € U,O(x), which in turn becomes Jx, (x € U A O(x)). Note that the logical statement 
would be true even if there were only one orange rose, despite the use of the plural “roses” 
in the English statement. 


(c) “Every carrot is orange or purple.” We encode this as Va € C, (O(a) V P(x)) or equiva- 
lently Vz, (x € C => (O(x) v P(a))). 


(d) “There is a purple and orange carrot.” We encode this as dr € C, (P(x) A O(x)) or 
equivalently Jx, (x € C A P(x) A O(x)). On the other hand, “Purple carrots exist, and 
orange carrots exist” is a different statement, which could be encoded as 


(ax, (x ECA P(x))) A (Ay, (y E CA O(y))). 


(e) “Any red rose is beautiful.” We have not introduced a universe of red roses, but we 
can use unrestricted quantifiers to write Vz,(R(x) Ax € U) = B(x). We could also say 
Va € U, (R(x) > B(a)). 

(£) “If purple carrots exist, then each white rose is beautiful.” In this sentence, two quanti- 
fiers appear within an IF-THEN construction. One possible encoding is 


dr, (x € C A P(x))| = Ny, (Wy) Ay € U) = B(y)). 


Note that adjectives modifying a noun produce A symbols in the encoding, whether the 
noun is existentially or universally quantified. 


(g) “a +y = y+ for all real numbers x and y.” Although English grammar allows a 
quantifier to appear after the variable it quantifies, formal logic requires the quantifier to 
come first. Thus we write Vz € R, Vy € R,x + y = y + x. Note that the word “and” in this 
sentence does not translate into the logical operator A, but allows the single phrase “for 
all” to stand for two universal quantifiers. 


(h) “n = 2k for some integer k.” Here too, the quantifier at the end of the English sentence 
must be moved to the front in the logical version, giving dk € Z,n = 2k. 


(i) “Roses are red, carrots are orange.” This poetic sentiment reveals a new and unpleasant 
feature of English: hidden quantifiers and logical operators. The use of the plural “roses” 
and “carrots” implicitly indicates that these statements are intended to apply to all roses 
and all carrots. Moreover, the comma joining the two phrases has the same meaning as 
AND. Thus we write (Vz, (x € U > R(x)) A (Vy, (y € C > O(y)). In this case, we could get 
away with a single quantifier for both clauses, writing 


Vz, ((2 E€ U = R(z))A (z € C => O(z))). 


But if A had been V in both statements, the two encodings above would not be equivalent. 
The first encoding now says “All roses are red OR all carrots are orange,” which is false (as 
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white roses and purple carrots do exist!). But the second encoding (upon eliminating IF) 
states that every object is not a rose or is red or is not a carrot or is orange, which is true. 


(j) “Each even integer greater than 2 is the sum of two primes.” This sentence (called 
Goldbach’s Conjecture) contains two hidden existential quantifiers signaled by the verb 
“is.” We can encode it as follows: 


Va € Z, (x is even A x > 2) > [Ay € Z, 3z € Z, (y is prime A z is prime A x = y + 2)]. 


As this example illustrates, we often allow ourselves to use restricted quantifiers involving 
a “standard” universe (like Z), but we apply the conversion rules to move the non-standard 
restrictions on the variables (like being even, or being prime) into the propositional part of 
the statement, rather than inventing new universes for the set of even integers larger than 
2 or the set of prime integers. 


Negating Quantified Statements 


We continue our translation examples, letting U be the universe of roses and letting R(x) 
mean “x is red.” The next family of examples reveals the subtleties that arise when words 
with a negating effect are mixed with quantifiers. 


1.40. Example. Encode each sentence as literally as possible, using only unrestricted 
quantifiers. 

(a) All roses are not red. 

b) Not all roses are red. 

c) No roses are red. 

d) There does not exist a red rose. 

e) Red roses do not exist. 

f) Some roses are not red. 

(g) There is a non-red rose. 

Solution. (a) We initially say Vz € U,~R(x), and convert this to Vz, (x € U =>~R(x)). (b) In 
this example, the word “not” applies to everything following it, so we get ~Vx € U, R(x), 
which converts to ~Vz,(a € U = R(x)). When we say “no roses” in part (c), we are 
making a negative statement about all roses, leading first to Vz € U,~R(a) and then to 
Va, (x € U =~R(zx)). (d) becomes ~x, (x € U A R(x)); (e) is encoded in exactly the same 
way, even though the word “exist” appears at the end. (f) says 3x, (x € UA ~R(x)), and (g) 
is encoded in the same way. Intuitively, each of the four statements (a), (c), (d), and (e) is 
saying the same thing. Similarly, statements (b), (f), and (g) say the same thing but are not 
the same as the other four statements. Using restricted quantifiers now, these observations 
mean that Vz € U,~ R(x) is equivalent to ~dx € U, R(x), whereas 3x € U,~ R(x) is 
equivalent to ~Vz € U, R(x). 


( 
( 
( 
( 
( 


The equivalences at the end of the last example hold in general. In other words, for any 
universe U and any open sentence P(x), we have the quantifier negation rules: 


~Jr € U, P(x) iff Va € U, ~P(x) | 


wa € U, P(x) iff da € U,~P(a) |. 


We justify the first rule intuitively, as follows. The left side of this rule is the negation of 
the statement “There is an xo in U making P(xo) true.” How can this existential statement 
fail to hold? The statement fails precisely when every single object xo in U fails to make 
P(a) true. This is exactly the universal statement on the right side of the rule. 
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Similarly, the left side of the second rule asserts the falsehood of the statement, “For 
every zo in U, P(aq) is true.” When does this universal statement fail? It fails when, and 
only when, at least one object xo in U fails to make P(g) true. This is exactly the existential 
statement on the right side of the second rule. 


Negations and Denials 


Given any proposition P, recall that the negation of P is the proposition ~P, which has 
the opposite truth value as P. We say that a proposition Q is a denial of P iff Q is logically 
equivalent to ~P. In general, if P is a complicated statement built from logical operators 
and quantifiers, then the negation ~P can be difficult to work with directly. Thus we need 
to develop techniques for passing from the particular denial ~P of P to another denial that 
does not begin with the negation symbol. It turns out that the most useful denials of P are 
those in which the negation symbol is not applied to any substatement involving a logical 
operator or quantifier symbol. 

How can we find a useful denial of P? In this section and previous ones, we have already 
derived the rules we need to simplify expressions that begin with NOT. These rules are 
summarized in the following table. The tricky aspect of this table is that we usually need 
to apply several rules recursively to convert the negation ~P into the most useful denial in 
which no subexpression begins with NOT. 


| Statement Denial of Statement Symbolic Version of Rule 
A and B (denial of A) or (denial of B) AA B) = (XA) V (~B) 


AorB (denial of A) and (denial of B) | ~(A V B) = (~A) A (~B) 
if A, then B A and (denial of B) ~(A > B)=AA(~B) 
not A A ~(~A)=A 


For all z, P(x) There is x, (denial of P(x)) 
There is x, P(x) | For all z, (denial of P()) 
A iff B A or B, not both 

A or B, not both | A iff B 


Q 


Ya € U, P(x) iff Jx € U, ~P(x) 
da € U, P(x) iff Yx € U, ~P(x) 
As B)=(A@6B) 
A®B)=(ASB) 


Informally, applying a negation operator converts AND to OR, OR to AND, FOR ALL 
to EXISTS, EXISTS to FOR ALL, and in these cases we must continue recursively, denying 
the inputs to these operators. On the other hand, negation converts IFF to XOR and XOR 
to IFF without needing to negate the inputs (see also Exercises 17 and 18). Applying a 
negation to something that already begins with NOT removes the NOT. Finally, the most 
troublesome entry in the table is the denial rule for IF. For emphasis, we repeat the key 
fact here: 


2 2/2 


The denial of an IF-statement is an AND-statement, 
not another IF-statement! 


One way to understand this is to recall that “if A then B” is equivalent to “(not A) or B.” 
Negating the latter expression produces “A and (not B),” as asserted in the table. 


1.41. Example. Find useful denials of each statement. 

(a) 3 is odd or 5 is not prime. 

(b) Ifa<b<c, then a < c. 

(c)a < conly ifa <b < c. 

(d) Not everyone likes math. 

Solution. A denial of (a) is “3 is not odd and 5 is prime.” Denying the IF-statement in (b) 
produces “a < b < c and a > c” (assuming a,b, c are real constants). Statement (c) contains 
two traps. First, the “only if” construction in (c) is equivalent (before negating) to the 
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statement “If a < c, then a < b < c.” Second, the phrase a < b < cis really an abbreviation 
for “a < b and b < c.” Thus, a denial of (c) is “a < c and (a > b or b > c).” It would 
be incorrect to replace the parenthesized expression by a > b > c, which still contains an 
implicit AND. Note also that (for real numbers) “not (a < b)” becomes a > b, as opposed 
toa >b. 

Finally, the negation of (d) is “It is false that not everyone likes math.” A useful denial 
of (d) is “Everyone likes math.” (Astute readers may have noticed that (d) and its denials 
are not really propositions.) 


Rant on Hidden Quantifiers (Optional) 


We have now seen several examples of hidden or implicit quantifiers in English sentences, in 
which universal or existential quantifiers are suggested by grammatical constructions (like 
plural subjects) rather than explicitly designated by phrases like “for all” or “there exists.” 
My personal position is that all mathematical writers should avoid the use of all hidden 
quantifiers in all circumstances. The reason is that one of the most common sources of 
error and confusion among beginners (and experts!) in this subject is forgetting to quantify 
a variable, confusing universal quantifiers with existential quantifiers, changing the meaning 
of a statement by moving a quantifier to a new location, or misinterpreting the meaning or 
scope of a quantifier. The correct use of quantifiers is one of the keys to success in reading 
and writing proofs, and it is one of the hardest skills for newcomers to master. Why, then, 
should we make it even harder by gratuitously omitting or disguising the quantifiers used 
in our own writing? I have tried very hard in this text (and elsewhere) to quantify every 
variable appearing in all proofs and other formal discussions, and I implore you to exert 
similar efforts in your own writing. 

Nevertheless, you must be aware that the majority of mathematical authors do not 
share my view and use hidden quantifiers all over the place. Here is a particularly common 
convention that you must know: in all theorem statements, any unquantified variables are 
understood to be universally quantified. The universe of objects often must be inferred from 
context as well. For example, an algebra text might announce as a theorem the commutative 
law for addition: x +y = y + zx. Filling in quantifiers, the actual theorem being presented is 
Yx € R, Vy € R,x +y = y + x. However, this convention must be used with care, keeping in 
mind all the various grammatical constructions that can serve the purpose of quantification. 
Consider, for instance, the statement: “An even integer n > 2 can be written as the sum 
of two primes p and q.” The article “an” indicates universal quantification of n, whereas 
the verb “can be” indicates existential quantification of p and q. To reduce the chance of 
confusion, and to draw attention to the fact that quantifiers are being used, in this text I 
prefer the symbols V and J (or the standard English phrases “for all” and “there exists”) to 
introduce quantified variables. So I would have written the statement under consideration 
like this: “For all even integers n > 2, there exist primes p and q such that n = p+q.” One 
final grammatical point: note the plural verb form EXIST (not EXISTS) agrees with the 
plural subject PRIMES occurring later in the sentence. 


Exercise: Find all the places in this text where, despite my comments here, I inadvertently 
used a hidden quantifier. 
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Rules for Converting and Negating Quantifiers. 


x eU, P(x 


= |Jx, (x € U^ P(a)) 


Va € U, P(x 


( 
( Yz, (x € U => P(x)) 
( 
( 


~Jr € U, P(x 


) 
J e 
)] & [Vz EV, ~Pla)] 
jp e 


Va E€ U, P(x 


Jx € U, ~P(x) 


Translation Points. (a) In English sentences, changing the relative positions of 
quantifiers and logical keywords (especially negative words like “not” ) can affect 
the meaning. 

(b) In symbolic logic (unlike English), quantifiers must precede the variables they 
modify. 

(c) Quantifiers can occur within other logical constructions (example: “if all roses 
are red, then some violets are blue”). Moving these quantifiers to the front of the 
symbolic version of the statement can change the meaning. 

(d) Quantifiers often occur implicitly in English; for instance, a plural subject 
may signal a universal quantifier, whereas the verb “is” can disguise one or more 
existential quantifiers. 


Finding Useful Denials. Applying a negation operator to a complex statement has 
the following effect on the outermost logical operator: A becomes V, V becomes 
A, V becomes J, J3 becomes V, ~ cancels out, <= becomes @, and @ becomes 
<. In the case of A, V, V, and J, we must continue by recursively denying the 
inputs to these operators. Regarding IF, chant this statement aloud five times: 
THE DENIAL OF A= BIS AA (~B). |The denial rules are summarized in the 


table on page 35. 


Exercises 


1. 


Convert each statement to an equivalent statement using unrestricted quantifiers. 
(a) Vn € Z,n is even @ n is odd. (b) dn € Z,n is odd A n is perfect. 

(c) Yx € R, x > 3 >x +2 > 5. (d) Jq € Q>0, Ir € Qo, 8 +r? = 1. 

(e) Vz ER, An € Za <n. 


Eliminate all propositional operators from the following expressions by converting 
to restricted quantifiers. (a) Jx, x € QA x? = 3. (b) Yx, x € Z > x? € Z>o. 

(c) Yz, Vy, (x € RAy E R) > x+y E R. (d) Yx, [z € Q > (an,n € Z>oArn E Z)]. 
(e) Bk,k E Z An = 2k] > [Am,m€ ZAn+3 = 2m + 1]. 

Let U be the set of people, let T(x) mean x is 20 feet tall, and let P be the 
statement “~v Jx € U,T(x).” Write statements equivalent to P using: (a) an 
unrestricted existential quantifier; (b) a restricted universal quantifier; (c) an 
unrestricted universal quantifier. Give literal English translations of P and your 
three answers. 


Let L be the set of lobsters, let R(x) mean z is red, and let Q be the statement 
“~Yr € L, R(x)” Write statements equivalent to Q using: (a) an unrestricted 
universal quantifier; (b) a restricted existential quantifier; (c) an unrestricted ex- 
istential quantifier. Give literal English translations of Q and your three answers. 


37 


38 
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14. 
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Consider the true sentence “Every human is male or female.” Someone incorrectly 
encodes this sentence as: (Yx € H, M(x)) V (Yx € H, F(x)), where H is the set of 
humans, M(x) means x is male, and F(x) means x is female. (a) Translate this 
encoding back into English (do not use variables or symbols), and explain how 
the meaning differs from the original sentence. (b) Give a correct encoding of the 
original sentence. 


Give a specific example of a universe U and open sentences P(x) and Q(x) such 
that the two statements Jx € U, (P(x)^AQ(x)) and [Ax € U, P(x)]^A [3x € U, Q(x)] 
have different truth values. Explain the difference in meaning between these two 
statements. 


Let H be the set of humans, and let M(x) mean x is mortal. Translate each 
symbolic formula into an English sentence that does not use propositional logic 
keywords such as AND, IF, etc. Which sentences are true? (a) Vx, x € H > M(x). 
(b) Yx, x € H A M(x). (c) 3z,x € H ^A (~M(x)). (d) 3z,x € H => (~M(zx)). 
Find useful denials of each statement. All answers should be English sentences. 
(a) John has brown hair and blue eyes. (b) If £o is even then 32 is odd. (c) All 
prime numbers are odd. (d) Purple cows do not exist. (e) There are deserts with 
red sand. (f) The sine function is continuous or the secant function is not bounded. 


Find useful denials of each statement. All answers should be English sentences. 
(a) Jamestown was founded in 1607 or Williamsburg is not the capital of North 
Dakota. (b) 2+2 = 4 if 3+3 = 7. (c) All rubies are red. (d) Some diamonds are 
blue. (e) Not all elephants are pink. (f) All carrots are orange iff all rabbits are 
white. (g) Every integer is either positive or negative, but not both. (h) For all 
natural numbers zx, if x is prime and even, then x = 2. 


Convert each proposition to the standard form P => Q, and then give a useful 
denial of the statement. (a) G has a proper normal subgroup only if G is not 
simple. (b) A sufficient condition for normality of X is metrizability of X. (c) For 
f to be Lebesgue integrable, it is necessary that f be measurable. (d) Invertibility 
of A implies det(A) 4 0. (e) Whenever no is a power of an odd prime, no has a 
primitive root. (f) Injectivity of g is sufficient for g to have an inverse. 

True or false? Explain. 

(a) Jx € Z, x? = —4. (b) Iz, x € ZA a? = —4. (c) Iz, s E€ Z > r? = —4. 

(d) Yz € Z,£+1 = 1+ z. (e) Vx, x € Z > xz+1 = 1440. (f) Yxz,x € ZAz+1 = 142. 


Encode each English sentence in symbolic form. Only use unrestricted quantifiers 
in your answers. Let B(x) mean x is black, R(x) mean x is a raven, and E(x) 
mean zg is evil. 

Sample Question: All black ravens are evil. Answer: Yx, |(B(x)AR(x)) > E(x)]. 
(a) Some ravens are evil. (b) Non-black ravens exist. (c) Although every raven is 
black, not every raven is evil. (d) Each raven is either black or evil, but not both. 
(e) If some ravens are black, then all ravens are evil. (f) Not all ravens are evil if 
some ravens are not black. 


Write useful denials of each statement in the previous exercise; your answers 
should be English sentences. 


Let M be the set of melons, let V be the set of vegetables, let G(x) mean x is 
green, let O(a) mean z is orange, let T(x) mean z is tasty, and let J(x) mean z is 
juicy. Encode each English sentence in symbolic form; your final answers should 
not use restricted quantifiers. (a) There exists an orange vegetable. (b) Every 
melon is juicy. (c) No green vegetable is tasty. (d) Some melons are green, some 
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are orange, but all are tasty. (e) If some orange vegetable is juicy, then all green 
melons are tasty. (f) All tasty melons are orange. (g) Juiciness is a necessary 
condition for a melon to be tasty. (h) Vegetables are green only if they are not 
juicy. 

Write useful denials of each statement in the previous exercise; your answers 
should be English sentences. 


Find all hidden quantifiers in the following sentences, and rewrite the sentences 
to make all quantifiers explicit. (a) Eagles have wings, but pigs do not. (b) The 
square of a real number is nonnegative. (c) Positive integers are expressible as 
sums of four squares. (d) A cyclic group G can be generated by an element x in 
G. 


Use truth tables to show that all of the following propositional forms are possible 
denials f P Q. (a) Pe(~Q) b) (6P) SQ (c) (WP) 6 (~Q) 


Which of the following propositional forms are correct denials of P 9 Q? Explain. 
(a) (~P) A^ (~Q) (b) (~P)SQ (c) (~P) & (~Q) 

(d) (~P) S (~Q) (e) PE (~Q) (@)~Peg) 

Let F(x) mean z is a fish, let A(x) mean z is an animal, let S(x) mean x can 
swim, let L(x) mean « has lungs, and let W(x) mean z is white. Use these open 
sentences to encode each English sentence in symbolic form. (a) Some fish are 
white. (b) Not all animals have lungs. (c) Every white fish can swim. (d) Some 
animals that can swim are not fish, but all fish can swim. (e) Being a fish is a 
sufficient condition to be able to swim. (f) If no fish have lungs, then some white 
animals cannot swim. 


Write useful denials of each statement in the previous exercise; your answers 
should be English sentences. 


Encode each sentence in symbolic form, using only unrestricted quantifiers. Let 
R(x) mean z is a rabbit, U(x) mean zx is a unicorn, W (x) mean « is white, and 
C(x) mean z is cute. (a) All white rabbits are cute. (b) Some rabbits are not 
cute. (c) All unicorns are white but not cute. (d) There are non-white unicorns. 
(e) Although not every rabbit is white, every unicorn is cute. (f) All rabbits 
are cute only if some unicorns are white. (g) Some white objects are unicorns 
or rabbits, but not both. (h) For all objects, a necessary condition for being a 
unicorn is being white. (i) The non-cuteness of some rabbits is sufficient for the 
whiteness of all unicorns. 


Write useful denials of each statement in the previous exercise; your answers 
should be English sentences. 


Find all the hidden quantifiers in the true-false exercises of §1.3. Consider whether 
this changes any of your answers to those exercises (e.g., if some whales live in 
aquariums rather than in the ocean). 
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1.6 Denial Practice and Uniqueness Statements 


Finding useful denials of complex logical statements is an absolutely fundamental skill 
needed to understand and create mathematical proofs. The only way to master this skill, 
like so many others, is practice! You will practice translating and denying various statements 
in the first half of this section. Afterward, we consider one of the more subtle ideas in 
quantified logic — the concept of uniqueness. 


Practice with Useful Denials 


In the last section, we gave a table summarizing the rules needed to find useful denials of 
logical statements. Review that table now, memorizing all the rules, and noting particularly 
that the denial of an IF-statement is an AND-statement. Next, spend some time working 
out useful denials of the statements in the next example. Solutions and discussion appear in 
the next subsection. For complicated sentences, you might begin by converting the sentence 
(partially or completely) into symbolic form, to see how logical operators nest within the 
sentence. It can also help to rewrite potentially confusing phrases such as “only if” or “it 
is necessary” into standard IF-THEN form. 


1.42. Example: Denial Practice. Write a useful denial of each statement. 
) If a > b, then a? > b?. 
) a > 3 iff —2a < —4. 
c) All carrots are orange. 
d) There is an integer x such that x is odd and z is perfect. 
e) All roses are not red. 
f) Roses are red, yet some violets are not blue. 
g)1+1=2 and 2 x 2 = 4; or 3? = 8. 
h) Some cherries are not red, or all apples are red, but not both. 
i) Some pigs can fly, if water is wet or the sky is blue. 
) For a to be prime, it is necessary that a is odd or a > 2. 
k) If a is prime, then: a is even implies a = 2. 
1) A sufficient condition for the existence of white ravens is the non-existence of black doves. 
m) Jx € Z, Jy € Z, Jz € Z, (x > 0 and y > 0 and z? + y? = 23). 
n) a > b is necessary and sufficient for 6-1 > a-! only if b > 0 and a > 0. 
o) Va, Jy, Yz, ((x < y + z) > (z < zy < y)). 


(a 
(b 
( 
( 
( 
( 
( 
( 
( 
(j 
( 
( 
( 
( 
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Solutions to Denial Exercise 


Do not read any further until you have tried all parts of the preceding example yourself! 
(a) A denial of “if P then Q” is “P and not Q.” So one answer is “a > b and a? < b?.” 


(b) Denying an IF-statement produces an XOR-statement. So one answer is “a > 3 or 
—2a < —4, but not both.” 


(c) Denying a universal statement produces an existential statement. So one answer is “Some 
carrot is not orange.” 


(d) Denials convert 3 to Y and A to V. So one answer is “For all integers x, x is not odd or 
x is not perfect.” 


(e) Answer: “Some rose is red.” In this example, avoid the temptation to declare that a 
denial of “All roses are not red” is “All roses are red.” Observe that both of the latter state- 
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ments are false, so they cannot be denials of each other. The double negation rule can only 
be used to cancel a NOT operator applying to the entire statement, including quantifiers. 


(£) This sentence features an implicit universal quantifier (signaled by the plural subject 
“roses” ), as well as the word YET, which has the same logical meaning as AND. Denying 
the sentence produces “Some rose is not red, or all violets are blue.” 


(g) Notice the semicolon in the original statement, which serves to group the clauses of the 
sentence as shown by the parentheses here: “(1 + 1 = 2 and 2 x 2 = 4) or 3? = 8.” Using 
the denial rules for OR and then AND, we get “(1 +142 or 2x 2 Æ 4), and 3? 48.” To 
avoid parentheses, we could say: 1+ 1 Æ 2 or 2x 2 Æ 4; and 3? 48. 


(h) This sentence combines “Some cherries are not red” and “All apples are red” with the 
exclusive-OR operation. Denying gives “Some cherries are not red iff all apples are red.” 


(i) This sentence is an IF-statement where the hypothesis comes at the end. Denying gives 
“Water is wet or the sky is blue, and all pigs cannot fly.” 


(j) Before denying, the sentence can be rewritten: “If a is prime, then (a is odd or a > 2).” 
After denying, we get “a is prime and a is not odd and a < 2.” 


(k) The conclusion of this IF-statement is another IF-statement. Denying each IF in turn 
produces “a is prime and a is even and a # 2.” 


(1) Before denying, we rewrite the sentence as: “If black doves do not exist, then white ravens 
exist.” Denying gives “Black doves do not exist and white ravens do not exist.” Since both 
new clauses begin with NOT, we can further simplify the denial to: “All doves are not black, 
and all ravens are not white.” 


(m) In statements starting with multiple quantifiers, imagine forming the negation by plac- 
ing the negation symbol ~ at the far left. Now move this symbol to the right past each 
quantifier, flipping quantifiers from 4 to V and vice versa. Finally, the negation symbol acts 
on the ANDs in the middle of the statement, flipping them into ORs. The final denial is 


Va €Z, Vy € Z,Yz E€ Z, (x <0 Vy < 0V r? +y? £ 2°). 


(n) The statement can be recast as “If (a > b iff b>! > a~'), then (b > 0 and a > 0).” 
Denying the statement produces “(a > b iff b7! > a~') and (b < 0 or a < 0).” 


(o) Flipping the initial quantifiers and denying the IF, we get “Ix, Yy, Iz, (£ < y +2) A(z > 
xy V xy > y).” Note that the original formula z < zy < y contains an implicit AND, which 
becomes OR in the denial. 


Uniqueness 


An object is unique iff it is the only one of its kind. We now introduce a modified version 
of the existential quantifier that can be used to assert the existence of a unique object 
satisfying some property. 


1.43. Definition: Uniqueness Symbol. Let U be a set, and let P(x) be an open sentence. 


The statement |3!x € U, P(x)| means that there exists exactly one object xo in U for 


which P(xo) is true. Equivalently, there exists a unique object xo in U making P(xo) true. 
The unrestricted quantifier 3! x, P(x) is defined similarly; this statement means that there 
exists one and only one object x making P(aq) true. The exclamation mark following the 
existential quantifier is called the uniqueness symbol. 


1.44. Example. True or false? 
(a) J!x ER 22 =2. 
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(b) Ala € Zyo,27 =a. 

(c) dia € R,’ =8. 

(d) 3!x € Z, 3x = 5. 

(e) dla € Q, 3x = 5. 


Solution. (a) is false, because there are two real numbers x that satisfy x? = x, namely 
x = 0 and x = 1. On the other hand, (b) is true, since x? = x has one and only one 
solution (namely x = 1) belonging to the given universe Zs. Part (c) is true, since z = 2 
is the unique real solution to x? = 8. (Note that (—2)? = —8.) Part (c) would be false if we 
enlarged the universe from R to C. Part (d) is false because there is no integer x for which 
3x = 5. On the other hand, x = 5/3 is in Q and satisfies this equation, so “Jx € Q, 3x = 5” 
is true. Moreover, x = 5/3 is the only solution to 3x = 5 in Q, so that the stronger existence- 
and-uniqueness statement in (e) is also true. (e) would remain true in the larger universes 
R or C. 


It is possible to eliminate the uniqueness symbol, replacing 4!2, P(x) by an equivalent 
statement using previously introduced logical symbols. This elimination rule gives us insight 
into the precise meaning of uniqueness, and we often need the rule when giving proofs 
of uniqueness. To derive the rule, let us consider the logical encoding of several related 
statements first. 

Step 1. How can we encode the following statement? “There exist at least two objects 
zo making P(xo) true.” A first attempt might be to write 3x, dy, P(x) A P(y). However, 
this does not quite work, because we are allowed to pick the same object for x and for y. 
For instance, we see that dx, dy,r+1=3Ay+1=3 is true by taking x = y = 2. If we 
intend the variables x and y to represent different (distinct) objects, we must explicitly say 
so by saying x Æ y. So the given statement can be encoded as follows: 


Jx, 3y, [(P(x) A P(y)) Ax A yl. 


Step 2. Now consider how to encode “There exists at most one object xo making P(x) 
true.” The key is to recognize that the situation described here (having at most one object 
that works) is the exact logical opposite of the situation in Step 1 (having at least two 
objects that work). Thus we can obtain the answer by denying the statement from Step 1. 
One possible denial looks like this: 


vz, Vy, [~(P(x) A P(y)) Væ = y]. 


We could continue to simplify the denial by replacing ~(P(x) A P(y)) by (~P(x)) V(~P(y)). 
However, another way to proceed is to remember the equivalence A => B = (~A) V B from 
the Theorem on IF. Using this equivalence in reverse, we obtain the following IF-statement 
as a possible denial of the statement in Step 1: 


Va, Vy, (P(x) A P(y)) > z = y]. (1.5) 


Reading this in English, with some words added for emphasis, (1.5) says that “for all objects 
zo and yo, if P(xo) and P(yo) both happen to be true, then it must be the case that x 
actually equals yo.” This statement does have the intended effect of preventing more than 
one object from satisfying the open sentence P(x). 

Although it already follows from the denial rules, let us confirm directly that (1.5) is true 
when P(x) is satisfied by zero objects, or by exactly one object. First suppose no objects 
make P(x) true. Then for any objects xo and yo, P(£o) A P(yo) is false, so the IF-statement 
is true. Thus, (1.5) is true in this situation. Now suppose exactly one object z9 makes P(x) 
true. In this case, when x = zo and y = Zo, the IF-statement says (P(z0)^P(z0)) > 20 = 20, 
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which is true. On the other hand, for all other choices of x and y, the IF-statement has a 
false hypothesis, hence is true. 

Step 3. We are now ready to encode the target statement, “There exists exactly one 
object xo making P(x) true.” The key is to rewrite this assertion using AND, as follows: 
“There exists at least one object x9 making P(zo) true, AND there exists at most one 
object xo making P(2o) true.” The first clause can be handled by an ordinary existential 
quantifier, and we encoded the second clause in Step 2. To summarize, our elimination rule 
for the uniqueness symbol is: 


dla,P(x)| & | (aa, P(x)) A (Va, Vy, (P(x) A P(y)) > z = y]) | 


A similar rule holds for restricted quantifiers. For instance, J! x € R, x? = 8 can be rewritten 
as 


(Ix € R, £? = 8) A (Yz ER, Vy ER, (£? =8Ay? =8) > z =y). 


We can also adapt the reasoning in Steps 1 through 3 to make statements such as “there 
are at least three objects x) making P(xo) true,” “there are at most two objects xo making 
P(ao) true,” “there are exactly two objects xo making P(ao) true,” and so on. We also 
mention that a statement such as, “the solution is unique, if it exists” is asserting the 
existence of at most one solution, hence could be encoded by the formula (1.5). 


1.45. Remark. The English word UNIQUE is a special kind of adjective called an absolute 
(or non-gradable) adjective. Absolute adjectives cannot be modified by adverbs indicating 
the degree to which that adjective applies. For example, although it might be very cold 
today, or somewhat cold, or really cold, or colder than yesterday, we cannot say that an 
object is very unique, somewhat unique, really unique, more unique, or almost unique. An 
object either is unique (the one and only one object in a given universe having a specified 
property), or it is not. 


Section Summary 


1. Denials. Have you fully memorized the table of denial rules on page 35? Keep 
practicing until the rules become second nature. 


2. Uniqueness. 3!x € U, P(x) means there exists exactly one object xp in U making 
P(x) true. We could say “unique” or “one and only one” instead of “exactly 
one” here. We can eliminate the uniqueness symbol (the exclamation mark “!”) 
with this rule: 


lx € U, P(x)| = | (Ex € U, P(x)) A (Va € U, Vy € U, [(P(x) A Ply) > « = y)) | 


lw 


To say there is at most one xo in U making P(xo) true, we write: 


Va € U,Vy € U, (P(x) A Ply) > z =y]. 


Exercises 


1. True or false? Explain. 
(a) d!2eER,27=2. (b) d!2 € Reo, 2? =2. 
(c) dla € Z,x?-32-10=0. (d) a! a € Zyo, £? — 3x — 10 = 0. 
(e) I!x € Z>0,£? +4r+4=0. (f) alae R27? +4r+4=0. 
2. True or false? Explain. 
(a) 3lrEeZ,2<x<4. (b)3lreEeQ,2<xr<4. (o) 3lreEeZ,2<x<4. 
(da) 3!y € Z,5y-7=2. (e)3!lyE€Q,5y-7=2. (f) d!lyeR 5y-7=2. 


44 


3. 


4. 
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Write a useful denial of each statement. (Assume all unquantified letters are 
constants.) 

a) If ab £0, then a £0 and b Æ 0. 

b) All perfect numbers have last digit 6 or 8. 

c) (X is compact) = (X is closed and bounded). 

) When X is compact, X is closed and bounded. 

) 

) 


a 


e) Va €Z,Aye Qer<y<axtl. 
f) If every odd number is prime, then some raven is not black. 

g) Not all elephants are pink. 

h) 2+2 = 4 only if 2 x 3 = 9. 

Write a useful denial of each statement. (Assume all unquantified letters are 
constants.) 

a) If ab = 0, then a = 0 or b = 0. 

b) Some prime numbers p satisfy 10 < p < 20. 

c) X is compact if X is both complete and totally bounded. 

d) X is connected and compact whenever X is a closed interval. 

f) If some carrot is not orange, then every even number is prime. 

g) Mauve pigs do not exist. 

h) A necessary condition for 1 +1 = 2 is that 2 x 5 = 11. 

i) Some violets are blue only if all oranges are green or yellow. 

j) (P ®Q) @ R) A (Q > (RV (~P))). 

Write a useful denial of each statement. 

a) For every positive real number y, there is a real number x with e” = y. 

b) For all real x and z, if tan z = tan z, then x = z. 

c) For all real x and y, y = 3a + 1 iff x = (y — 1)/3. 


Write a useful denial of each statement. 

a) A necessary condition for all roses to be red is that some carrots are white or 
blue. 

b) Ve € Rso, 36 € Ryo, Vx € R, Vy E R, (|x — y| < ô = | sin(x) — sin(y)| < €). 

c) For all functions f, if the continuity of f is sufficient for the differentiability 
of f, then the integrability of f is necessary for the continuity of f. 

d) ((P => Q) @ R) V (Q A (R= (~P))). 

e) For some real x, x > 2 implies x? > 4 but not conversely. 

Write a useful denial of this statement: 0 € H and Va,b € H,a +b € H and 
Vee H,-ce H. 


For each set H, decide whether the statement in the previous exercise or its denial 
is true for H. (a) H = Q. (b) H = R>o. (c) H = Zyo. (d) H = {-1,0,1}. (e) H 
is the set of even integers. 


Write a useful denial of each statement. 

a) Je € G,Vx E€ Gyexzr =£ =2£xe. 

b) Yz € P, Vy € P,Y⁄z E€ Pa<y<2z5>aK<z. 

c) IL € R, Ve € R>0, IN € Z>0,Vn E Z>o,n > N > |an — L| < €. 
d) Jy € R, [Vx € S,y < x) A (Yz E R, y < z > Iu € S, u < z)]. 
Write a useful denial of each statement. 

a) IM E R, Yz € S,Vy € S, d(x,y) < M. 

b) Vy € G, 3z E€ Gyyxz=e=2xy. 

c) Ve € Rao YN €E Zug, In € Z>o,n > N Ad(an, £) <€. 

Let xo be a fixed object. Encode the following sentence using logical symbols: “ao 
is the unique object in U making P(x) true.” 
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Encode the following sentence using logical symbols: “for all but one object xo in 
U, Q(x) is true.” 


Find a useful denial of 3!x € U, P(x) by eliminating the uniqueness symbol and 
applying the denial rules. Give your answer in symbols and then in English. 


Consider this statement: “If 3! a, P(x) is true, then dx, P(x) must be true.” 
a) Is this statement true for all open sentences P(x)? Explain. 

b) Repeat (a) for the converse of the given statement. 

c) Repeat (a) for the contrapositive of the given statement. 


Consider this statement: “If 3x € U, P(x) is true, then 3z, P(x) must be true.” 
a) Is this statement true for all choices of U and P(x)? Explain. 
b) Repeat (a) for the converse of the given statement. 


Consider this statement: “If 3!x € U, P(x) is true, then 4! xz, P(x) must be true.” 
a) Is this statement true for all choices of U and P(x)? Explain. 
b) Repeat (a) for the converse of the given statement. 


Let P(x) be a fixed open sentence. Find symbolic versions of each statement 
below that do not use the uniqueness symbol. 

(a) There exist at least three objects zo making P(xo) true. 

(b) There exist at most two objects x9 making P(g) true. 

(c) There exist exactly two objects xg making P(xo) true. 


Encode the following sentence using logical symbols: “There exist exactly four 
objects xo making P(xo) true.” 
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Proofs 


2.1 Definitions, Axioms, Theorems, and Proofs 


In most branches of science, such as physics or chemistry, we develop and test theories based 
on experiments. Mathematics, on the other hand, is a deductive science, in which we use 
logical rules to derive a large body of truths from a small set of initial statements, called 
axioms. This section introduces the main ingredients in mathematical theories: undefined 
terms, definitions, axioms, theorems, inference rules, and proofs. 


Undefined Terms and Definitions 


In mathematics, we would like to give a precise and rigorous meaning to every word and 
symbol that we use. This is typically done by giving formal definitions for various mathe- 
matical terms. However, all words and symbols are defined by means of other words and 
symbols, so it is not possible to define everything! The solution is to begin our theory with 
a collection of undefined terms, such as “true,” “false,” “for all,” “set,” and “membership 
in a set.” We can often convey an intuitive idea of what an undefined term is supposed to 
mean by giving synonyms for the term or specific examples. For instance, we might say that 
a set is a collection of objects, or we might give examples of sets such as {1, 2,3} or the set 
of all cows. But, these informal explanations are not mathematical definitions. Ultimately, 
the meaning of the undefined terms is captured by the axioms we use to describe their 
properties. 

Most of the words and symbols we use are, in fact, defined terms. For instance, we 
have already defined the precise meaning of words such as AND, OR, and IF by means of 
truth tables that show how these words are related to the undefined concepts of “true” and 
“false.” Typically, a mathematical definition describes a new object, relationship, or symbol 
via an expression consisting of previously defined symbols. Almost every definition contains 
quantifiers and logical keywords. For example, the following IFF-statement can be viewed 
as a definition of the uniqueness symbol: 


gJ!x € UỌ,P({x)| & |r cu, P(2)) A (Va €U,Vy € U, ([P(x2) A Ply] > «= y)|. 


In the next few sections, we will be practicing proofs using four basic concepts that you 
may have already seen informally: even numbers, odd numbers, divisibility, and rational 
numbers. We now give formal definitions of these concepts. 


2.1. Definition of EVEN, ODD, DIVIDES, and RATIONAL. 


(a) For all n, | n is even | means: | dk € Z,n = 2k. 


(b) For all n, | n is odd | means: | dk € Z,n = 2k +1. 
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(c) For all integers a and b, | a divides b| means: | dc € Z, b = ac. 


(d) For all x, | x is rational | means: | Im € Z, an € Z, (n 4O0A 2 =m/n). 


Before continuing, we offer some advice and remarks about definitions. When studying 
any mathematical subject, you must memorize all definitions perfectly as soon as they 
occur. Do so now for the four definitions listed above. Note that each definition consists of the 
defined term (appearing in the box on the left) followed by the definition text (appearing in 
the box on the right). The overall definition asserts the logical equivalence of the defined term 
and the definition text, for all choices of the variables in the defined term. The quantifiers and 
universes appearing on the right side of each definition are a crucial part of the definition; 
do not forget them! For example, when asked what “n is odd” means, the answer “Jk, n = 
2k + 1” is incorrect (the universe is omitted), and the answer “n = 2k + 1” is even worse 
(the quantifier on k is omitted). 

The only unimportant feature of the definitions are the letters used on each side. Different 
letters may be used instead, as long as there is no conflict with previously introduced letters. 
Sometimes we must use a different letter on the right side, if the letters in the original 
definition already have a different meaning. Furthermore, expressions can be substituted 
for the letters on the left side of a definition. These observations are basic, but they are so 
crucial that we give an example. 


2.2. Example. The phrase “m is odd” means Jk € Z,m = 2k + 1. We could also have 
said dz € Z,m = 2z + 1 or dq € Z,m = 2q + 1 as the definition of this phrase. The phrase 
“k is even” means Ju € Z, k = 2u; here, we needed to change the quantified variable from 
k to some other letter (in this case, u) because k already had a different meaning. The 
phrase “5x + 7 is odd” means Jk € Z,5a +7 = 2k +1. The phrase “c divides c?” means 
Ad € Z, = cd (we had to change the quantified variable from c to d here). The phrase 
“m? + n? is rational” means Ja € Z, 3b € Z, (b # 0 A M? +n? = a/b). 


We reiterate that formal mathematical definitions must be meticulously memorized, in- 
cluding all quantifiers and universes. It is fine, and indeed helpful, to have intuitive descrip- 
tions of what a formal definition is saying. For example, the intuition for the definition of 
“rational” is that a rational number is the ratio of two integers. However, this phrase does 
not capture the full content of the definition (the existential quantifiers should be explicit, 
and the denominator must be nonzero). 


Axioms 


In developing a mathematical theory, we would ideally like to derive every true statement as 
a logical consequence of previously known statements. However, as in the case of definitions 
and undefined terms, it is impossible to prove literally every statement in this way. For, 
at the very beginning, we have not derived any true statements, so we have nothing to 
work with. The solution is to begin our theory with a small collection of initial statements, 
called azioms or postulates, that are assumed to be true without proof. Each axiom is a 
proposition that may use quantifiers, logical symbols, undefined terms, and defined terms. 

What are some examples of axioms? We obtain a large supply of axioms by agreeing 
that any instance of a tautology is an axiom. Recall that a tautology is a propositional 
form, such as P V (~P), that is true for all values of the propositional variables appearing 
in the form. An instance of a tautology is any statement obtained from the propositional 
form by replacing all propositional variables by specific propositions. For example, here are 
some instances of the tautology P V (~P): “1+1 = 2 or 1+1 Æ 2;” “All roses are red or not 
all roses are red;” “The sine function is continuous or the sine function is not continuous;” 
“(0 = 1 and some pigs can fly) or it is false that (0 = 1 and some pigs can fly).” All of these 
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statements are axioms. Instances of tautologies are particularly trustworthy axioms, since 
the truth of these statements is virtually forced upon us by the way we have agreed to use 
the logical connectives such as NOT, AND, and OR. 

Every new definition can be viewed as a special kind of axiom, called a definitional 
axiom. For example, we can rewrite the definition of EVEN (given above) as the following 
axiom: 


Yn, [n is even & Jk € Z,n = 2k]. 
On one hand, the forward implication of this IFF-statement lets us eliminate the defined 
term “n is even” and replace it with the definition text “Jk € Z,n = 2k” (where k is 
a new variable). On the other hand, the converse implication lets us go back from the 
definition text to the defined term, when convenient. The definitions of ODD, DIVIDES, 
and RATIONAL can similarly be recast as axioms involving universally quantified IFF- 
statements. For instance: 


Va € Z, Vb € Z, [a divides b & Ac € Z, b = ac]. 


In what follows, we state all new definitions as definitional axioms in this way. 


2.3. Remark. Some mathematical texts present new definitions as IF-statements rather 
than IFF-statements. For instance, a text might say, “We define an integer n to be even 
IF n = 2k for some integer k.” This practice, while very common, is logically wrong! Each 
definition needs to be an “if and only if” statement telling us precisely when the newly 
defined term is to be considered true. 

All mathematical theories share a certain common core of logical axioms (e.g., instances 
of tautologies, and axioms concerning quantifiers and the equality symbol). Each particular 
theory, such as the theory of sets or the theory of integers or the theory of Euclidean geome- 
try, contains additional axioms giving specific properties of the objects under investigation. 
Some of the axioms in the theory of sets will be introduced later when we study set theory. 
In the theory of integers, here is a sample of some of the possible axioms we might start 
with. (We assume Z, +, -, 0, and 1 are undefined terms.) 

2.4. Some Axioms for Z. 

(a) Closure under Addition: Yx € Z,Vy € Z,x +y E Z. 

(b) Commutativity of Addition: Yx € Z,Vy E Z,£z +y =y+r2. 

c) Associativity of Addition: Yx € Z, Yy € Z,Yz € Z, (x +y) +z =x + (y+ z2). 
d) Additive Identity: Vx € Z, x +0 = z. 

) Additive Inverses: Vx € Z, Jy € Z, x +y = 0. 

f) Closure under Multiplication: Yx € Z,Vy € Z,xz -y E Z. 


( 
( 
( 
( 
(g) 
(h) 
( 
( 
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Commutativity of Multiplication: Yx € Z,Vy E€ Z, £ -y =y- x. 

Associativity of Multiplication: Yx € Z, Yy € Z,Vz € Z, (x-y) -z =x- (y; z). 
Multiplicative Identity: Va € Z, x - 1 = x. 

j) Distributive Law: Va € Z, Vy € Z,Vz € Z,x - (y +z) = (x-y) + (z-z). 

k) No Zero Divisors: Vx € Z,Vy E€ Z, (x #0Ay #0)> xz- y FO. 


These axioms describe some of the basic properties of integer addition and integer multi- 
plication. We could list more axioms giving properties of equalities and inequalities among 
integers. In fact, all the axioms we have listed can be proved from even more basic axioms 
that we do not state here. However, for our initial practice with proofs, we assume that all 
of the above statements and similar basic algebraic facts are already known. This includes 


h 
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arithmetical facts such as 2 + 2 = 4, ~(1/2 € Z), etc., as well as properties of inequalities 
such as: 
Va € Z, Yy € Z, Yz € Z,if x < y and z > 0 then xz < yz. 


We also assume that analogous facts about the real number system are already known. 
However, unless otherwise noted, we do not allow ourselves to use any fact that specifically 
concerns the concepts of even numbers, odd numbers, divisibility, and rational numbers. In 
Chapter 8, we give a detailed development of the real number system starting from a list of 
19 axioms. This development makes extensive use of the proof techniques to be presented 
in the coming sections. 


Theorems, Inference Rules, and Proofs 


Starting from the axioms, we use rules of logic to deduce new true statements called the- 
orems. Each axiom is also considered a theorem. The sequence of steps leading from the 
axioms to a given theorem is called a proof of that theorem. An individual step in a proof 
uses an inference rule to combine one or more previous theorems to obtain another theorem. 
Our first inference rule involves the logical keyword IF. 


2.5. Inference Rule for IF. Suppose P = Q and P are already known to be theorems. 
Then we may deduce Q as a new theorem. 


We can informally justify this rule using the truth table for IF, shown here: 


P Q| P>QR 
T T T 
T F F 
F T T 
F F T 


In the situation described in the rule, P is a known theorem, and hence is a true propo- 
sition. Thus we must be in one of the first two rows of the truth table. But P > Q is also 
known to be true, forcing us to be in the first row of the truth table. In this row, Q is a 
true proposition, so we may safely conclude that Q is a theorem. Similar reasoning with the 
truth table for IF establishes the following inference rule. 


2.6. Contrapositive Inference Rule for IF. Suppose P > Q and ~Q are already known 
to be theorems. Then we may deduce ~P as a new theorem. 


We can also justify the Contrapositive Inference Rule as follows. Given that P > Qisa 
known theorem, the logically equivalent statement (~Q) => (~P) must also be true. Given 
that ~Q is also known to be true, we can deduce the truth of ~P by applying the Inference 
Rule for IF to the theorems (~Q) => (~P) and ~Q. 

On the other hand, suppose P = Q and Q are known theorems. Is it always safe to 
deduce P as a new theorem? Consulting the truth table for IF, we see that the answer is 
no! More specifically, in row 3 of the truth table, P > Q and Q are both true, yet P is false. 
The incorrect deduction of P from known theorems P => Q and Q is sometimes called the 
converse error. 

Some inference rules involve quantifiers, as we see in the next rule. This rule formally 
recasts our earlier intuitive description of what a universal quantifier means. 


2.7. Inference Rule for ALL. Suppose Vx € U, P(x) is a known theorem and c € U isa 
known theorem, where c is a variable or expression denoting a particular object. Then we 
may deduce the new theorem P(c). 
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Intuitively, if we already know that property P is true for all objects x in U, and if 
we also know that the expression c represents an object in U, then we can conclude that 
property P is true for the particular object c. 


2.8. Example. Suppose we have already proved the theorem Vz € R, x? > 0, and suppose 
a and b are fixed positive real numbers. Then ya and Vb are also positive real numbers, 
and hence c = /a— vb is in R because R is closed under subtraction. Now we can apply the 
Inference Rule for ALL, replacing the quantified variable x in “x? > 0” by the expression 
c. We deduce the new theorem (ya — Vb)? > 0. Now we manipulate this inequality using 
known algebraic facts. Expanding the square, we get a— 2yav b+b > 0. Rearranging terms, 
we get a+b > 2v'ab and then (a+b)/2 > Vab. This inequality says that the arithmetic mean 
of two positive numbers is greater than or equal to the geometric mean of those numbers. 


We can obtain more inference rules for AND, OR, IFF, etc., by analyzing truth tables. 
Some examples of these rules are considered in the exercises. But these new inference rules 
turn out to be redundant, because they already follow by combining tautology-based axioms 
with the Inference Rule for IF. For instance, consider this inference rule for AND: given 
that P A Q is a known theorem, we can deduce Q as a new theorem. To see why this rule 
is redundant, note that (P A Q) > Q is a tautology and hence an axiom. Suppose that we 
have proved PA Q already. Using the Inference Rule for IF, we see that Q is a new theorem. 
Similarly, the Contrapositive Inference Rule for IF follows from the Inference Rule for IF 
and an appropriate tautology. 


2.9. Remark. In many logic texts, the Inference Rule for IF is called the modus ponens 
rule, abbreviated MP. “Modus ponens” is a Latin phrase meaning “method of affirming.” 
The Contrapositive Inference Rule for IF is called the modus tollens rule (MT), which means 
“method of raising.” The Inference Rule for ALL is called universal instantiation (UI). We 
shall not use these terms. 


More on Mathematical Theories (Optional) 


This optional subsection gives a bit more detail on how logicians use axiom systems and 
inference rules to develop formal mathematical theories. The problem of not being able to 
define the initial words and symbols is dealt with in the following way. First, we invent 
a formal language using a highly restricted set of symbols (such as ~, >, V, =, €, and 
letters for variables and constants). There are concrete rules determining which strings of 
symbols in this language constitute terms and formulas. Certain strings of symbols in the 
language are designated as axioms, and certain inference rules are given. The key is that 
each inference rule can be executed by performing very basic mechanical manipulations on 
strings of symbols, without needing to know what any of the symbols mean. For instance, 
one inference rule (corresponding to the Inference Rule for IF discussed earlier) says: if the 
string of symbols consisting of a formula P followed by the symbol = followed by a formula 
Q is a known theorem, and if the string of symbols P is a known theorem, then the string 
of symbols Q is a new theorem. The inference rule for universal quantifiers says (roughly): 
if one of our known theorems consists of the symbol V, then a variable symbol x, then a 
formula P, and if t is any term, then we get a new theorem by writing the string P with 
all occurrences of the symbol x replaced by the string of symbols t. We see that axioms, 
proofs, and theorems are all syntactic constructions — properties of the formal language 
that make no specific reference to the underlying meaning of the symbols. 

Of course, the only reason we care about proving theorems is because of what they 
mean! Thus, every formal language has an associated semantic component that attempts 
to define the meaning of the terms, formulas, and theorems in the language. The idea is 
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to create an interpretation for the formal theory by introducing a specific universe (set) 
of objects for the variables to range through, and stipulating the meaning (within this 
specific model) of each symbol in the language. For instance, the meaning of the logical 
connectives is defined using truth tables, as before. A given formal theory can have many 
different interpretations, so the meaning of each formula in the theory depends on which 
interpretation is used. In some interpretations, all of the axioms of the theory become true 
statements; such interpretations are called models of the theory. (Actually, this is not quite 
true: some theories are inconsistent, and have no models at all.) 

It can be shown that all theorems of a formal language (not just the axioms) become true 
statements in any model of that theory. A celebrated result of mathematical logic (Gödel’s 
Completeness Theorem) asserts that, conversely, any formula of a theory that is true in all 
models of that theory is provable from the axioms of that theory. This result is absolutely 
amazing, because it means that the apparently mindless symbol-manipulation rules of a 
formal theory are powerful enough to discover all true statements expressible within that 
theory! Put another way, the concrete syntactic machinery of proofs and formal theorems 
delivers abstract semantic results applicable to a wide variety of situations. Many theorems 
of modern mathematical logic have a similar flavor, establishing the equivalence of a syn- 
tactic concept and a semantic concept. For instance, another version of the Completeness 
Theorem states that a theory has no models iff a contradictory formula of the form PA(~P) 
is a formal theorem within that theory. 

One practical benefit of organizing mathematical theories using abstract axiom systems 
is economy of thought. By selecting appropriate general axioms at the outset, we can simul- 
taneously prove facts about many different situations all at once. As an example of this, we 
now describe the axioms used to define an abstract algebraic structure called a group. The 
undefined symbols are G (representing the set of objects in the group), * (representing an 
algebraic operation that combines two elements of G to produce a new element of G), and 
e (representing an identity element for the operation). The axioms are listed next. 


2.10. Group Axioms. 

(a) Closure: Va € G,Vy E G,rxy EG. 

(b) Associativity: Yx € G,Vy E€ G,Vz E€ G, £x (yx z) = (ay) *z. 
(c) Identity: Vx € G,zrxe=x^exr=r. 
( 


d) Inverses: Vx € G, Jy E€ Gz xy =e^yx*xr=e. 


Starting from these axioms, we can prove many theorems about abstract groups. One 
such theorem is the Left Cancellation Law: 


Va € G, Yx € G, Vy E€ G, (axr =axy) > rt=y. 


The advantage of the axiomatic setup is that all theorems proved for abstract groups from 
the group axioms are automatically available for any particular concrete group we may 
happen to be working with. For example, it can be shown that the integers under addition, 
the nonzero real numbers under multiplication, and invertible 3 x 3 matrices under matrix 
multiplication all satisfy the group axioms. Each of these mathematical structures therefore 
has a left cancellation law. Rather than reproving this law (and all the other theorems of 
group theory) each time a new system comes along, we need only verify that the new system 
satisfies the four initial axioms of a group, and then we know that all the theorems are true 
for this system. 
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Elements of Mathematical Theories. A mathematical theory starts with a small 
collection of undefined terms and introduces definitions for new concepts that 
combine logical operators, undefined terms, and previous definitions. Similarly, 
the theory starts with axioms (statements assumed to be true without proof) 
and uses chains of inference rules called proofs to deduce new theorems. When 
learning a new theory, it is critical to memorize definitions, axioms, and theorem 
statements as soon as they arise. 


Definitions of Even, Odd, Divides, Rational. Memorize these definitional ax- 
ioms, which will be the subject of practice proofs in the next few sections. 


a) For all n, |n is even | if | Ik € Z, n = 2k |. 


b) For all n, |n is odd | iff 


kEZ,n=2k+1\. 


c) For all a,b € Z, |a divides b| iff | 3c € Z,b = ac |. 


d) For all x, | x is rational | iff | 3m € Z, dn € Z, (n #0 Az = m/n) |. 
Note that quantifiers and universes are critical features of the definition, whereas 
the letters used are not important. Expressions can be substituted for the letters 
in the defined term, and letters in the definition text must be changed if they 
already have a different meaning. 


Axioms. Every instance of a tautology is an axiom. Every new definition can be 
viewed as an axiom. Some axioms for Z are listed in item 2.4. The closure axioms 
state that the sum or product of any two integers is also an integer. There are 
axioms asserting the commutative, associative, distributive, and identity laws for 
addition and multiplication. Another axiom says that the product of any two 
nonzero integers is always nonzero. 


Inference Rules. The Inference Rule for IF says that if P > Q and P are already 
known theorems, we may deduce Q as a new theorem. The Inference Rule for 
ALL says that if Vz € U, P(x) and c € U are already known theorems, we may 
deduce P(c) as a new theorem. Many more inference rules follow from the rule 
for IF using appropriate tautologies. 


Exercises 


1. 


Rewrite each statement by expanding definitions. For statements with negative 
logic, give a useful denial of the expanded definition. (Do not prove anything 
here.) [Sample Question: u? is not odd. Answer: Vk € Z,u? A 2k + 1.] 

(a) 37 is odd. (b) k? +k is even. (c) m? — 5 is rational. (d) 7 divides c. (e) 3 does 
not divide 5. (f) v2 is not rational. 


Rewrite each statement by expanding definitions. For statements with negative 
logic, give a useful denial of the expanded definition. 

(a) —18 is even. (b) Ê +3k+1 is odd. (c) Vm? + n? is rational. (d) c+ 1 divides 
c? — 1. (e) ab does not divide 8b + c. (f) e +7 is not rational. 


Informally justify each inference rule by analyzing a truth table, imitating our 
discussion of the Inference Rule for IF. 

(a) Given known theorems P > Q and ~Q, deduce the new theorem ~P. 

(b) Given known theorems A and B, deduce the new theorem A A B. 
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4. 


10. 


11. 


12. 


13. 
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c) Given the known theorem A A B, deduce the new theorem B. 
d) Given the known theorem (~Q) => (RA (~R)), deduce the new theorem Q. 


Show that each proposed inference rule is not correct by analyzing truth tables. 
a) Given the known theorem AV B, deduce the new theorem A. 

b) Given known theorems P > Q and ~P, deduce the new theorem ~Q. 

c) Given the known theorem P => Q, deduce the new theorem P > Q. 


Decide, with explanation, whether each proposed inference rule is always valid. 
a) Given the known theorem Q, deduce the new theorem P V Q. 

b) Given the known theorem Q, deduce the new theorem P $ Q. 

c) Given the known theorem P © Q, deduce the new theorem P > Q. 

d) Given known theorems P © Q and ~P, deduce the new theorem ~Q. 

e) Given the known theorem P > (Q => R), deduce the new theorem 
P=Q)=R. 

f) Given the known theorem (P > Q) => R, deduce the new theorem 

P= (Q= R). 

Show that the following inference rule is valid by a truth table analysis: given 
known theorems PV QV R, P > S, Q = S, and R= S, deduce the new theorem 
S. 


(a) Write the contrapositive of the IF-statement in Axiom 2.4(k). (b) Write the 
converse of your answer to part (a). Is this converse true for all x,y € Z? 


Suppose “Vx € Z, 4x is even” is a known theorem, and a and b are fixed positive 
integers. Which of the following conclusions can be drawn from this theorem using 
the Inference Rule for ALL? Explain. (a) 8 is even. (b) 6 is even. (c) —100 is 
even. (d) 0.4 is even (taking x = 1/10). (e) 4b is even. (f) 4(a + 3b) is even. 
(g) 4(a/b) is even. 


Suppose “Vk € Z,k? — k is even” is a known theorem, and a and b are fixed 
positive integers. Which conclusions can be drawn from this theorem using the 
Inference Rule for ALL? Explain. (a) 20 is even. (b) 6 is even. (c) 8 is even. 
(d) —30 is even. (e) —1/4 is even (taking k = 1/2). (£) (a+b)? — (a+b) is even. 
(g) (a/b)? — (a/b) is even. [Sample Answer: (a) CAN be deduced by taking k = 5 
in the known theorem, since k? — k = 25 — 5 = 20] 


Suppose <, =, and the logical operators are undefined concepts in a mathematical 
theory of real numbers. Use these symbols to give formal definitions of: (a) x < y; 
(b) x > y; (c) x > y; (d) z is strictly between x and y. 


Using only arithmetic operators (+, -, <) and logical symbols, give definitions 
of the following concepts, assuming m,n € Z and x € R: (a) m is a perfect 
square (examples: 0, 1, 4, 9, 100); (b) n is a multiple of m; (c) x is a half-integer 
(examples: 5/2, —22.5, 100/2); (d) n is prime (examples: 2, 3, 5, 7, 11, 101). 


Consider this proposed inference rule involving propositional forms P, Q, R, and 
S: “given known theorems P, Q, and R, deduce the new theorem S.” (a) Assume 
(PAQAR) => S is a tautology. Argue informally that the proposed inference rule 
is valid. (b) Now assume that the proposed inference rule always works. Argue 
informally that (PA QA R) => S is a tautology. 


Suppose the words IF and NOT are undefined concepts in a certain logical theory. 
Using only these words, complete the following definitions of other logical key- 
words. (a) | P OR Q | means... (b) | P AND Q | means... (c) | P IFF Q | means... 


(d) | P XOR Q | means... 
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Suppose the symbols A, ~, and J are undefined symbols in a certain logical theory. 
Using only these symbols and parentheses, complete the following definitions of 
other logical symbols. (a) | P V Q | means... (b) | P > Q | means... (c) |P Q 


means... (d) | Yx, P(x) | means... 


Let x,y, a,b be fixed real numbers. Show how to deduce the new theorem 


ax + by)? < (a? +b?) (x£? +y?) from the known theorem Vz € R,0 < 2? using the 


Inference Rule for ALL and algebraic manipulations. 


Which structures (G,x*,e) satisfy all the group axioms? Explain. 


a) G = {-1,0,1}, axb means a + b, e is 0. 

b) G = {—1,1}, axb means a- b, e is 1. 

c) G = Z, a xb is always 4, e is 4. 

d) G = {0,1,2}, ax0 = a = 0 xa for all a in G, axb = 0 for all nonzero a,b € G, 


e is 0. 


e) G = {0,1,2,...}, axb is the maximum of a and b, e is 0. 
f)G=Z,axb=a+b-2, eis 2. 


Assume (G,x*,e) is a group and a,b,c are fixed elements of G such that a xb = 
e =bxc. Use the group axioms and the Inference Rule for ALL to prove a = c. 


In a certain logical theory, suppose we adopt the following axioms for equality: 


(i) Va,a=2; (ii) Va, Vb, Ve, (a=cAb=c)>a=b. 
(a) Let r and s be fixed objects. Prove r = s > s = r, justifying every step using 


axiom (i), axiom (ii), an instance of a tautology, the Inference Rule for IF, or the 
Inference Rule for ALL. [Hint: P > [((PAQ) => R) > (Q > R)] is a tautology. 


(b) Let r, s,t be fixed objects. Prove (r=sAs=t)>r=t. 


Let n be a fixed integer. Use the axioms for Z given in the section to prove 
0-n=0. Which axioms are used in the proof? 


Suppose, instead of using all tautologies, we only allow instances of the following 


three tautologies as axioms for our theory: 


i) P= (Q= P); 

ii) (P > (Q = R)) + (P > Q) > (P = B); 

iii) (~Q) > (~P)) > (P > Q). 

Let A and B be specific, fixed propositions. Using only instances of (i), (ii), and 
iii) and the Inference Rule for >, prove: (a) A > A; (b) (~B) > (B = A). 
Remarkably, it can be shown that all tautologies are derivable in this way from 
i), (ii), and (iii).] 
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2.2 Proving Existence Statements and IF Statements 


Our goal in the next several sections is to learn how to write precise, formal, mathematical 
proofs. The cornerstone of our exposition is the idea that the logical structure of a statement 
dictates the structure of the proof of that statement. We will give a series of rules, called proof 
templates, for automatically constructing the structural outline of a proof of a statement 
based on the main logical operator appearing in that statement. For instance, one of the 
proof templates says that to prove an IF-statement P => Q, we may write a proof that 
starts: “Assume P is true. Prove Q is true.” We can then continue to use proof templates 
recursively to generate a proof of the new goal, Q, aided by the extra information that P 
is true. We introduce proof templates for existential statements and IF-statements in this 
section; other logical operators are covered later. 


Initial Advice on Proofs 


To gain facility at writing proofs, it is crucial to memorize the proof templates, and practice 
using the proof templates. Equally crucial is to memorize all definitions (did I say that 
already?), since a very common step in proofs is to replace a defined concept with its 
definition. You cannot carry out this step if you have not memorized the relevant definition. 

An essential feature of proofs is that every formula, statement, and letter in a proof has 
a logical status that needs to be explicitly indicated with appropriate words and phrases. 
For instance, the logical status of a given statement might be a known result, something to 
be proved, or a temporary assumption. The logical status of a letter in a proof might be a 
fixed (constant) object not chosen by us, a specific (constant) object that we choose, or a 
quantified variable. Here are some examples of sentences in which the logical status words 
have been written in bold: “Assume n is even.” “We must prove n+ 3 is odd.” “We 
know n > 0 orn < 0.” “Let xo be a fixed, but arbitrary object.” The proof templates 
often direct us to write certain sentences that contain logical status words — do not omit 
these words! Note, for instance, that the IF-template mentioned above contains the logical 
status words “assume” and “prove.” In our initial examples of proofs below, we stress the 
use of logical status words by placing them in bold type. 

At any point in a proof, we can replace any statement by an equivalent statement, which 
has the same logical status as the original statement. For instance, suppose A & B is 
already known (in the most common case, this might be a definitional axiom in which a 
new term appearing in A is defined via the statement B). If we have assumed A in a proof, 
then we can continue the proof by saying that we have assumed B. If we already know 
A, then we can continue by saying that we know B. Finally, if we are trying to prove A, 
then we can instead try to prove B. These replacement rules are used constantly in proofs, 
e.g., every time we expand a definition. 

In many places in a proof, we may need to introduce new variables and constants. 
Whenever this occurs, we must take care not to reuse a letter that has already been given a 
meaning. For instance, suppose we have just written a line in a proof that says: “Assume 
n is even and m is even.” To continue the proof, we might try expanding the definition of 
“even” in both places. An incorrect way to do this would be to say: “We have assumed 
dk € Z,n = 2k and Jk € Z,m = 2k.” The error arises because there is no guarantee that 
the same integer k will work for both n and m (indeed, this would only be true if n = m). 
One possible correct way to proceed would be to say: “We have assumed Jk € Z,n = 2k 
and Jj € Z,m = 2j,” provided that k and j have not already been given a meaning. 
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Proving Existence Statements: Proof By Example 


Here is our first official proof template, which tells us that we can prove existential state- 
ments by constructing a specific example of an object that works. 


2.11. Proof Template for Proving Jx € U, P(x). 

Choose a specific object xo that you think satisfies the statement. (This £o may depend 
on constants that have already appeared in the proof.) 

Prove that xo is in the universe U. 

Prove that P(x) is true. 


The proof template does not tell you how to choose the object x 9 — that is up to you! 
Often, the hardest part of applying this template is deciding on an appropriate £o to use. 
This proof method is called proof by construction or proof by example. Let us turn to some 
illustrations of this proof template, which use the definitions given in the previous section. 
Note that the statements we prove in our initial examples are very basic and may seem 
obvious. We have deliberately done this to allow the reader to focus on each new proof 
template and the structural features of proofs, without getting sidetracked by complex 
mathematical content. That will come soon enough! 


2.12. Example. Prove that 52 is even. 

Proof. By definition of “even,” we must prove that Jk € Z,52 = 2k. Choose k = 26. We 
know that 26 is an integer, and 52 = 2 x 26 by arithmetic. 

Comments: The first step in this proof is replacing a defined term by the text of its definition. 
We do this step all the time in proofs. (Did you memorize the definitions?) This produced 
an existence statement that needed to be proved. Here we could immediately see which 
specific k would work, but it was still necessary to point out that k was in the universe Z. 
If you think this is not really necessary, try imitating this proof to prove that 52 is odd — 
what goes wrong? 


2.13. Example. Prove that 0.36 is rational. 

Proof. By definition of “rational,” we must prove Im € Z,4n € Z,n 4 0A 0.36 = m/n. 
Choose m = 36 and n = 100. We know 36 is an integer, 100 is an integer, 100 4 0, and 
0.36 = 36/100. 

Comments: Other choices of m and n also work (for instance, m = 9 and n = 25), but the 
proof template only requires us to find a single valid example. 


2.14. Example. Prove that 7 divides 0. 

Proof. By definition of “divides,” we must prove 3w € Z,0 = 7w. Choose w = 0. Note 
that 0€ Zand 0=7-0. 

Comments: Was it obvious to you before reading the proof that 7 would divide 0? Also 
observe that logical status can be indicated in many ways; here we said “note that” to 
introduce some known arithmetical facts. 


In the next example, and throughout this text, we shall sometimes include commentary 
within a proof indicating what we are thinking and how we are generating the next step of 
the proof. This commentary appears in square brackets, and is not part of the proof itself. 


2.15. Example. Suppose a is a fixed (constant) integer. Prove that 4a? + 11 is odd. 

Proof. By definition of “odd,” we must prove Jk € Z,4a? + 11 = 2k +1. [It is probably 
not evident at this stage what k to choose. So we instead continue by manipulating 4a? +11 
as follows.] By algebra, we know that 4a? + 11 = 4a? + 10 + 1 = 2(2a? +5) +1. Choose 
k = 2a? + 5; the previous equation shows that 4a? + 11 = 2k + 1 is true. Moreover, since 
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a and 2 and 5 are all integers, a? and 2a? and k = 2a? +5 are also integers because Z is 


closed under addition and multiplication. 

Comments: We frequently need the closure properties of Z to see that the objects we choose 
in proofs do belong to the universe Z as needed. Also note that the integer k that we chose 
in the proof was allowed to be an expression involving the previously introduced constant 
a. This is what we mean in the proof template when we say that the chosen object £o is 
allowed to “depend on” previous constants. 


2.16. Example. Let c be a fixed integer. Prove that c divides 5c?. 

Proof. We must prove Im € Z,5c? = cm. Choose m = 5c?, which is an integer since it 
is the product of the integers 5, c, and c. By algebra, cm = c(5c”) = 5c?, as needed. 
Comments: When we expanded the definition of “divides,” we used m (not c) as the quan- 
tified variable since c already had a meaning. As in the last proof, our choice of m in this 
proof is allowed to depend on the previously introduced constant c. 


Direct Proof of an IF-Statement 


Theorems involving IF-statements are ubiquitous in mathematics, and we will develop sev- 
eral different methods for proving IF-statements. The most basic method, called direct proof, 
is presented in the following fundamental proof template. 


2.17. Proof Template for a Direct Proof of P > Q. 
Assume P is true. Prove Q is true. [To prove Q, use the assumed statement P and other 
known facts.] 


This proof template features a new idea — the notion of temporarily assuming a state- 
ment that may or may not actually be true. We can explain informally why the proof 
template is valid by considering the truth table for IF one more time: 


P Q|P=Q 
T T T 
T F F 
F T T 
F F T 


When using the direct proof template, our goal is to show that P = Q is true, but we 
do not yet know that this statement is true. We do know that P, being a proposition, must 
be either true or false. Now, in the event that P is false, we see from the truth table that 
P = Q is automatically true; we do not need to prove anything in this case. So we may 
as well assume we are in the other case, where the hypothesis P is true. If we can then 
deduce that Q must be true, we must be in the first row of the truth table, and in that 
row, P = Q is true. Note carefully that, at the end of the proof, we have not proved that 
Q itself must be true. Our proof of Q relies on the assumption that P is true, which may 
not actually be correct. The final conclusion of the proof, i.e., that P = Q is true, does not 
rely on the temporary assumption that P is true. 

Let us start with an example using defined terms you probably have not seen, to show 
how a proof template can guide you in an unfamiliar setting. 


2.18. Example. Outline the proof of this statement: “If f is continuous, then f is mea- 
surable.” Here, f is a fixed function. Even without knowing the meaning of “continuous” 
or “measurable,” we can still begin the proof by saying: Assume that f is continuous. We 
must prove that f is measurable. To proceed, we would have to know the definitions of 
continuity and measurability. 
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2.19. Example. Let xo be a fixed integer. Prove: if £o is odd, then zo + 7 is even. 

Proof. Assume zo is odd. We will prove 79+7 is even. [How can we continue? Our only real 
option is to start expanding definitions; note how we carry along logical status words.] We 
have assumed there is an integer k with zo = 2k+1. We must prove dm € Z, x9 +7 = 2m. 
From our assumption, we see that £o + 7 = 2k +1 +7 = 2k +8 = 2(k +4). Choose 
m = k+4, which is an integer since k is an integer. Our calculation shows that x9 +7 = 2m, 
as needed. 

Comments: We did not get to choose k, which came from an assumed existential statement. 
But, we did get to use k when we chose m in the existential statement to be proved. In 
general, statements that are already known (or assumed) are treated entirely differently 
from statements that have yet to be proved. This is one reason it is crucial to indicate the 
logical status of every statement in a proof. 


2.20. Example. Let a,b,c be fixed (constant) integers. Prove: if a divides b and b divides 
c, then a divides c. 

Proof. Assume a divides b and 6 divides c. We must prove a divides c. Writing out the 
definition three times, we have assumed that for some integer x, b = ax; and for some 
integer y, c = by; we must prove Jw € Z,c = aw. Combining our assumptions, we 
see that c = by = (ax)y = a(ry). So, choosing w = xy, c = aw holds. Note that w € Z, 
since w is the product of the two integers x and y. 

Comments: When we wrote out the definition of “divides” in three places, note that we 
used three different letters (x, y, and w) for the quantified variable. The original definition 
used c for this variable; but that letter already has a meaning in this setting, so we must 
avoid it. 


The Inference Rule for EXISTS (Optional) 


Recall that the Inference Rule for ALL (sometimes called universal instantiation or UI) 
allows us to pass from a known universal statement Va € U, P(x) to particular instances of 
that statement. Specifically, we are allowed to choose any object c known to be in U and 
deduce that P(c) is a true proposition. We now discuss a related rule, the Inference Rule for 
EXISTS, which is sometimes called existential instantiation or EI. This rule lets us extract 
information from a known ezistential statement. Suppose we know (or have assumed) the 
statement Jx € U, P(x). The EI rule allows us to invent a new constant symbol, say xo, and 
assert that zo € U and P(ao) are true. It is crucial to note that we do not have the freedom 
to make a specific choice of xo (in contrast to the UI rule). In other words, we know that 
Xo exists, but we do not have any control over what it is (other than knowing it is in U and 
makes P(x) true). The symbol zo created by the EI rule must be a new constant, which 
does not already have some other meaning assigned to it. 

In the main text, we tacitly use the EI rule without explicit mention, since the rule 
often confuses beginners. To see how this rule works, let us return to the first few lines 
of the proof in Example 2.19. We initially assumed zg is odd, and stated the new goal 
of proving zo + 7 is even. Invoking the definition of ODD, we see that we have assumed 
an existential statement, namely 4k € Z,xp = 2k + 1. To proceed with the proof, we 
must process the existential quantifier using the Inference Rule for EXISTS (EI). That rule 
provides us with a new constant ko that we cannot choose, but which we know satisfies 
ko € Z and xp = 2ko + 1. Here k (with no subscript) is a quantified variable, whereas ko is 
a fixed, but unknown constant. Technically, rg = 2k + 1 is not a proposition (it is an open 
sentence), but £o = 2ko + 1 is a proposition. The proof given earlier in this section did not 
draw attention to this logical nuance, instead passing directly from the assumption “zo is 
odd” to the existence of a constant integer (called k) satisfying zo = 2k+1. Here is a version 
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of the proof where EI is invoked explicitly, and where subscripts are used throughout to 
distinguish quantified variables from constants. 

Proof. Assume zo is odd; we must prove xo +7 is even. We assumed Jk € Z, xo = 2k +1. By 
EI, let ko be a fixed integer satisfying £o = 2k9 + 1. We must prove dm €E Z, £o + 7 = 2m. 
Choose mo = ko + 4, which is in Z since it is the sum of two integers. Now calculate 
zo +7 = 2ko +1 +7 = 2ko +8 = 2(ko +4) = 2mo. We have now proved Im € Z, xo +7 = 2m, 
so zo +7 is indeed even. 


Formal Justification of Proof by Example (Optional) 


We hope the proof template for proving an existential statement Jz, P(x) is very plausible 
at the intuitive level: if you are able to choose a specific object xo and prove P(x) is 
true, then surely Jx, P(x) is true! However, we can also give a more formal justification 
of this proof technique based on an unexpected connection to the Inference Rule for ALL. 
One way to formulate this rule is to introduce axioms of the form (Vz, P(x)) = P(ao). 
There is one such axiom for each open sentence P(x), variable x, and constant xo. We can 
replace the Inference Rule for ALL by these new axioms, since the Inference Rule for IF 
allows us to combine the axiom (Vz, P(a#)) = P(xo) and a known statement Vx, P(x) to 
deduce the new theorem P(xo). Now recall that every IF-statement is logically equivalent 
to its contrapositive. Apply this comment to the new axiom (Vx, ~ P(x)) = ~P(ao) to 
see that (~~P(ao)) >~Va,~P(x) has the logical status of an axiom. Finally, use denial 
rules to simplify this statement into the form P(x%o9) > 3x, P(x). We can now justify “proof 
by example” as follows: once we have proved P(a) is true, we use the previous formula 
and the Inference Rule for IF to obtain the new theorem Jz, P(x). For simplicity, we gave 
this discussion for unrestricted quantifiers, but the same argument works for restricted 
quantifiers. 


Section Summary 


1. Proof Templates. The logical structure of a statement dictates the structure of 

the proof of that statement. For each logical operator discussed earlier (like >, 
J, etc.), we will give a proof template for proving statements built from that 
operator. Statements containing several operators are proved by applying several 
proof templates recursively. 


2. General Advice for Proofs. Memorize all definitions! Memorize and practice the 
proof templates! Do not forget to introduce every statement, formula, and letter in 
a proof with words indicating their logical status! Do not reuse letters that already 
have a meaning! Do not overuse exclamation marks! You can always replace a 
statement by an equivalent statement, which has the same logical status as the 
original statement. We use this step all the time when expanding definitions. 


3. Proof by Example. To prove 3x € U, P(x): choose a specific object xo (which can 
depend on previously introduced constants); prove xo € U; and prove P(z9). 
We often need closure properties of U to see that the chosen object does belong 
to U. 


4. Direct Proof. To prove P = Q: assume P; prove Q. After writing these two 
lines, we often continue by expanding definitions appearing in P and Q, or using 
another proof template to begin proving Q. 
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11. 


12. 


13. 


14. 


Let a and b be fixed integers. Give careful proofs of each statement, making sure 
to use appropriate logical status words. (a) —53 is odd. (b) v2.25 is rational. 
(c) 1 divides a. (d) 8a — 6b is even. (e) b is rational. 

Let r and s be fixed integers. Prove the following statements. (a) 4298 is even. 
(b) 1.3/5.2 is rational. (c) —s divides s. (d) 6r + 4s — 3 is odd. (e) r — s divides 
r3 — s. 

Let x,y be fixed integers. Prove the following statements. (a) 169 is rational. 
(b) 107 — 1 is odd. (c) x divides zë. (d) If x is even, then 32? is even. (e) If x is 
odd and y is even, then y — x is odd. 

Let m and n be fixed integers, and let x be a fixed real number. Prove: (a) If 
m is odd, then 5m — 3 is even. (b) If 5 divides n, then 5 divides mn. (c) If x is 
rational, then mz + n is rational. (d) m/(m? + n? + 1) is rational. 


Let a,b,c be fixed integers. Prove: (a) If a is even, then 3a + 5 is odd. (b) If a 
divides b and b divides c, then ab divides c?. (c) If a 4 b, then (a + b)/(a — b) is 
rational. 
Let a, b,c, r, s be fixed integers. Prove: if c divides r and c divides s, then c divides 
ar + bs. 
Let y, z be fixed real numbers. Prove: if y and z are both rational, then y + z is 
rational. 


Prove the following existential statements. (a) dm € Z, Jn € Z,30m+7n = 1. 

(b) da € Z, Jy € Z, 13a + 5y = 1. (c) Ja € Z, 5b € Z, 100a + 396 = 1. 

Prove the following statements. (a) Ja € Z>0, 4b € Z>0, a? + b? = 100. (b) Jx € 
Jz € Z, £5 + y5 = 25. (c) Ja € Z, 3b € Z, 3c € Z, Jd € Z,a? +b? + e + 


Let x be a fixed real number. Use facts from algebra to prove: (a) If 3x +7 = 13, 
then x = 2. (b) If x = 2, then z? — 6x? + 11x = 6. (c) If z? + 22 — 15 = 0, then 
x = 3 or x = —5. (d) If x > 3, then z? + 5x — 4 > 20. 


Write the converse of each statement in the previous exercise. Explain informally 
whether each converse is true or false. 


Write just the outline of a proof of each statement (imitate Example 2.18). Do 
not expand unfamiliar definitions. 

a) If a group G has 49 elements, then G is commutative. 

b) If a metric space X is complete and totally bounded, then X is compact. 

c) If the series $7] an converges, then limpo an = 0. 

d) If a function f has a right inverse, then f is surjective. 


Outline the proof of each statement using the templates for IF and J. 
a) If a group G has 700 elements, then Jx € G, x has order 7. 

b) If p is prime and Jk € Z, p = 4k + 1, then Jy € Z, Iz € Z, p = y? + 2°. 

c) If g is a polynomial of odd degree, then for some real number r, g(r) = 0. 


Let m and n be fixed positive integers. Carefully explain the error in the following 
incorrect proof of the statement: “if m is even, then m/n is even.” Proof. Assume 
m is even. We must prove m/n is even. We have assumed there exists an integer k 
with m = 2k. We must prove dp € Z, m/n = 2p. Dividing the assumed equation 
m = 2k by n (which is not zero) produces m/n = (2k)/n = 2(k/n). Hence, 
choosing p = k/n, we do indeed have m/n = 2p. So m/n is even. 
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15. 


16. 


17. 


18. 


19. 


20. 
21. 


22. 


23. 


24. 


25. 
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Let a, b, and c be fixed positive integers. We are trying to prove: “if a divides b and 
a divides c, then a divides b+ c.” Find and correct the error in the proof below. 
Proof. Assume a divides b and a divides c; we must prove a divides b + c. We 
have assumed there is an integer k with b = ak. We have also assumed there is an 
integer k with c = ak. We must prove dm € Z,b+c = am. Choose m = 2k, which 
is an integer. Use the assumptions to compute b + c = ak + ak = a(2k) = am, as 
needed. 


Let n be a fixed integer. Find the error in the following proof that n? +n is even. 
Proof. We must prove there exists k such that n?+n = 2k. Choose k = (n?+n)/2. 
Multiplying both sides by 2, we see that n? +n = 2k does hold. 


Let x be a fixed integer. Criticize the following proof of the statement “if x is 
even then z? — 1 is odd.” Proof. x = 2k, k € Z, x’ — 1 = (2k)? — 1 = 8k? — 1 = 
2(4k3 — 1) +1, 4k? -1 EZ. 

Let c be a fixed integer. Criticize the following proof of the statement “if 10 
divides c + 4, then 10 divides c — 6.” Assume 10 divides c — 6, which means 
Ja € Z,c—6 = 10a. Prove 10 divides c+ 4, which means 4b € Z,c + 4 = 10b. 
Choose b = a + 1, which is in Z by closure. Use algebra to compute 10b = 
10(a + 1) = 10a + 10 = c- 6 + 10 = c + 4. 


Let r and s be fixed real numbers. Find the error in the following proof that if r 
and s are rational and s Æ 0, then r?/s? is rational. Proof. Assume r and s are 
rational and s 4 0. Prove r?/s? is rational. We must prove Im € Z, in € Z,n #0 
and r?/s? = m/n. Choose m = r? and n = s?; note that n 4 0 because s £ 0. 


Give a correct proof of the statement in the previous problem. 


Let a and b be fixed nonzero integers. Find the error in the following proof of the 
statement “if a divides b and b divides a then b = +a.” Proof. Assume a divides 
b and b divides a. Prove b = +a. By assumption, there is an integer c with b = ca 
and a = cb. By substitution, a = cb = c(ca) = c?a. Since a Æ 0, we can divide by 
a to get c? = 1. Then c = 1 or c = —1, hence b = a or b = —a. 


Define a rational number r to be dyadic iff Im € Z,Ik € Zso,r = m/2*. Let 
x and y be fixed rational numbers. Prove: (a) If x and y are dyadic, then xy is 
dyadic. (b) If x and y are dyadic, then x + y is dyadic. 


Use the group axioms in §2.1 to prove this theorem about fixed elements a, b,c 
in a group (G,x,e): ifaxb=axc, then b = c. 


Define an integer m to be bisquare iff Ja € Z,Ib € Z,m = a? + b?. Let x,y be 
fixed integers. Prove: (a) 13 is bisquare. (b) 200 is bisquare. (c) 610 is bisquare. 
(d) x? is bisquare. (e) If x is bisquare, then xy? is bisquare. (f) Not every integer 
is bisquare. 


Let r and s be fixed integers. Prove: if r and s are bisquare, then rs is bisquare. 
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2.3 Contrapositive Proofs and IFF Proofs 


Recall our direct proof template for proving P > Q: “Assume P. Prove Q.” Let us see 
what happens when we use this template in the following example. 


2.21. Example. Let x be a fixed integer. Prove: if x? is even, then z is even. 

Proof. Assume x? is even; prove x is even. We have assumed that for some integer k, 
x? = 2k. We must prove Im € Z, x = 2m. Taking the square root of the known equation 
x? = 2k, we deduce 


x = V2k = /4(k/2) = 2./k/2. 
So choosing m = \/k/2, we have x = 2m, as needed. 


This proof has a minor flaw and a major flaw — can you spot them? The minor flaw 
is that x might be negative, but 2\/k/2 is always nonnegative. This can be corrected by 
using the negative of the square root when x < 0. The major flaw is that the proof did 
not check that m was in Z, as required by the definition. In general, dividing an integer k 
by 2 and then taking a square root produces a real number that may not be an integer. 
There is no straightforward way to correct this error. We need a new approach, called the 
contrapositive proof method. 


Contrapositive Proofs 


Recall that to prove any statement A, we can instead prove a new statement B that is 
known to be equivalent to A. In particular, taking A to be the IF-statement P > Q, we 
know that the contrapositive IF-statement (~Q) => (~P) is equivalent to A. Using the 
direct proof template to prove this new statement, we arrive at the following new proof 
template for proving the original IF-statement, called proof by contrapositive. 


2.22. Contrapositive Proof Template to Prove P > Q. 
Assume ~Q. Prove ~P. 
[We often replace ~Q and ~P by useful denials of Q and P, respectively. These are found 
using the denial rules.] 

We can repair the direct proof in Example 2.21 by doing a contrapositive proof instead. 
For this new proof, and throughout our initial discussion of proofs, we assume that the 
following fact about odd and even numbers is already known: 


Vn € Z,n is not even > n is odd. (2.1) 


(Though you are probably aware of this fact, it is not so easy to prove! We provide a proof 
of this fact later.) Since (~P) © Q is equivalent to P = (~Q), the fact also says that for 
all integers n, n is even iff n is not odd. Since (~P) © Q is also equivalent to P 6 Q, yet 
another way to rephrase the fact is to say that every integer is even or odd, but not both. 
With this fact in hand, let us return to our initial example. 


2.23. Example. Let x be a fixed integer. Prove: if x? is even, then z is even. 

Proof by Contrapositive. Assume x is not even. Prove x? is not even. Using the known 
fact, we have assumed z is odd, which means there is an integer k with « = 2k + 1. 
Similarly, we must prove z? is odd, which means Im € Z, x? = 2m + 1. Squaring the 
known equation x = 2k + 1 gives x? = 4k? + 4k +1 = 2(2k? + 2k) + 1. So choosing 
m = 2k? + 2k, which is an integer by closure properties of Z, we have x? = 2m + 1 as 
needed. 
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Comments: We can see retroactively why the contrapositive proof works and the direct proof 
fails. Namely, proving the contrapositive allows us to pass from assumed information about 
x to needed information about x? by squaring an equation. This step succeeds because Z is 
closed under addition and multiplication. On the other hand, in the direct proof we tried 
to go from assumptions about x? to needed information about x by taking square roots. 
Here we did not have closure properties to guarantee that the integer m (chosen at the end 
of the proof) must be an integer. 


2.24. Example. Let x and y be fixed real numbers. Prove: if 2?y + vy? < 30, then z < 2 
or y < 3. Proof by Contrapositive. Assume x > 2 and y > 3. Prove z?y + ry? > 30. To 
proceed, we need to use two known theorems from algebra regarding inequalities: 


Va,b,c,d E R, (a> b^Ac>d)=>a+c>b+d; (2.2) 


Va, b,c,d E R, (a> b> 0^c>d>0)=> ac> bd>0. (2.3) 


Taking a = 2, b = 2, c = y, and d = 3 in the first theorem, we deduce that z +y > 2+3, 
i.e., x +y > 5. [We have just used the Inference Rules for ALL and IF.] Using the same 
values in the second theorem, we deduce that zy > 2-3 = 6. Now, since r+y > 5>0 
and zy > 6 > 0, we can use the second theorem again to deduce that (x + y)xy > 30. By 
algebra, we know (x + y)ry = xz?°y + xy”. So x?y + xy? > 30, completing the proof. 
Comments: How did we know to try a contrapositive proof here, rather than a direct proof? 
Perhaps we started a direct proof by saying: “Assume x?y+ay? < 30. Prove x < 2 or y < 3.” 
At this point, we probably got stuck because it was unclear how to move forward from the 
rather complicated assumption. Doing a contrapositive proof instead provides the much 
more tractable assumption “x > 2 and y > 3.” 


2.25. Example. Let x and y be fixed integers. Prove: if xy is irrational, then «x is irrational 
or y is irrational. 

Proof. [The given statement has a lot of negative logic in it (note “irrational” means 
rational”), so we try simplifying it with a contrapositive proof. Denying the conclusion, we 
begin with the line:] Assume z is rational AND y is rational. [Denying the hypothesis, 
we continue by saying:] Prove xy is rational. [We continue by expanding definitions:] We 
have assumed x = m/n and y = p/q for some integers m,n, p,q with n and q nonzero. We 
must prove Ja € Z,4b € Z,b Æ 0A (xy = a/b). Multiplying together our assumptions, 
we deduce that xy = (mp)/(nq). Choose a = mp and b = nq; note a and b are integers 
by closure of Z, and b Æ 0 since b is the product of two nonzero integers. Also xy = a/b by 
the above calculation. [We have proved our goal, so the proof is now done.] 


“not 


Proving an AND Statement 


Let us take a break from the logical subtleties of IF-statements to introduce one of the more 
straightforward proof templates. This template tells us how to prove an AND-statement. 


2.26. Proof Template to Prove PAQ. 
Part 1. Prove P. 
Part 2. Prove Q. 


I hope this proof template is fully believable to you at the intuitive level. But we can 
still justify it by appealing to the truth table defining ^, as follows. Consulting the truth 
table, we see that the composite statement P ^ Q is true if and only if the two separate 
statements P (on the one hand) and Q (on the other hand) are both true. Thus to prove 
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the composite statement, it is enough to prove P by itself and also to prove Q by itself. 
These proofs often proceed by recursively using more proof templates. More generally, to 
prove an AND-statement of the form Pı A P2 A++- A Pa, we must carry out n independent 
subproofs: prove P;; next, prove P»; ...; finally, prove Pn. 


2.27. Example. Let b be a fixed integer. Prove 1 divides b and b divides b. 

Proof. First, we prove 1 divides b. By definition, we must prove dc € Z,b = 1c. Choose 
c = b; note c is an integer and 1c = 1b = b. Second, we prove b divides b. We must prove 
dd € Z,b = bd. Choose d = 1; this is an integer, and bd = b1 = b. 


2.28. Example. Let x and y be fixed real numbers. Prove: if x < y then x < (a+ y)/2 < y. 
Proof. Assume x < y. Prove x < (x + y)/2 < y. The goal contains a hidden AND; so we 
must prove x < (x + y)/2 AND (xz +y)/2 < y. 

Part 1. Prove x < (x+y)/2. Adding zx to both sides of the assumption x < y, we deduce 
2x < x + y. Multiplying this by the positive number 1/2, we get x < (x + y)/2. 

Part 2. Prove (x+y)/2 < y. Adding y to both sides of the assumption z < y, we deduce 
x+y < 2y. Multiplying this by the positive number 1/2, we get (x + y)/2 < y. 


Proving IFF Statements 


Our next proof template tells us how to prove an “if and only if” statement P = Q. The 
template arises by recalling that P = Q is equivalent to (P > Q) A^ (Q => P). We already 
know how to prove this new statement, using the templates for A and =. Writing this out, 
we get the following. 


2.29. Proof Template for Proving P< Q. 
Part 1. Prove P => Q (by any method). 
Part 2. Prove Q => P (by any method). 


We should make a few comments about this template. The proof method is called a 
two-part proof of the IFF-statement, since the proof requires us to prove two separate IF- 
statements. Part 1 is often called the forward direction or the direct implication. Part 2 
is often called the backward direction or the converse implication. If we are proving “P is 
necessary and sufficient for Q,” part 1 can be called the proof of sufficiency, and part 2 can 
be called the proof of necessity. 

Beginners sometimes forget to write part 2 by the time they finish writing part 1 of the 
proof. Thus, they have omitted half of the proof! To avoid this, I recommend setting up the 
entire proof structure shown above, by writing “Part 1. Prove P > Q,” leaving lots of space, 
writing “Part 2. Prove Q => P,” and then filling in both parts. Another remark is that there 
are multiple ways of completing part 1 — we could use direct proof, contrapositive proof, 
or proof by contradiction (to be discussed later). Independently of part 1, we have similar 
choices for how to prove part 2. The natural choice is to try a direct proof for each part. 
Often, however, one direction of the proof may be easier using a contrapositive proof (or 
perhaps a contradiction proof). Remember that these options are available. One final point: 
any temporary assumptions made in part 1 (e.g., “assume P”) are no longer available when 
we reach part 2. Similarly, things proved in part 1 using the assumptions in part 1 can no 
longer be used in part 2 (unless we can reprove them using the assumptions in part 2). On 
the other hand, variables and constants introduced in part 1 of the proof are allowed to be 
reused (with fresh meanings) in part 2 — although it may be preferable to use new letters 
instead to avoid confusion. 


2.30. Example. Let x be a fixed integer. Prove x is even iff x? is even. 
Proof. Part 1. Assume z is even; prove x” is even. We have assumed there is an integer 
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k with z = 2k. We must prove Im € Z,x? = 2m. Squaring the assumption gives 
x? = 4k? = 2(2k?). Choose m = 2k?; then m € Z and x? = 2m, as needed. 

Part 2. We must prove: “if x? is even, then g is even.” We did this earlier in the section 
(using a contrapositive proof), so we do not repeat the proof here. 


2.31. Example. Let r be a fixed real number. Prove r is rational iff —r is rational. 
Proof. Part 1. Assume r is rational. Prove —r is rational. We have assumed there are 
integers p,q with q Æ 0 and r = p/q. We must prove 4m,n € Z,n 4 0A —r = m/n. Since 
—r = (—p)/q, we can choose m = —p and n = q Æ 0; these are integers with —r = m/n. 
Part 2. Assume -~r is rational. Prove r is rational. We have assumed there are integers 
p,q with q # 0 and —r = p/q. We must prove Jm,n € Zn # 0Ar = m/n. Since 
r = —(—r) = (—p)/q, we can choose m = —p and n = q # 0; these are integers with 
r=m/n. 

Comments: Part 2 is totally separate from part 1, so we are allowed to reuse the letters 
p,q,m,n in part 2 (which no longer have the meanings they did in part 1). Despite the 
extreme similarity of the two parts of this proof, both parts are needed to have a complete 
proof of the IFF-statement. On the other hand, we will see later that part 1 of the proof 
really proved a universal statement: Vr € R,(r € Q = —r € Q). This leads to a shorter 
proof of part 2 that relies on the result proved in part 1. Using the Inference Rule for ALL, 
we can replace r by —r in the universal statement written above, to see that (—r) € Q > 
—(—r) € Q. Since —(—r) = r by algebra, we get (-r) E€ Q =r € Q, which proves part 2 
without expanding any definitions. 


2.32. Example. Outline the proof of this statement: 
“if X C R*, then: X is compact iff X is closed and bounded.” 
Solution. We can use the proof templates to generate the structural skeleton of the proof 
without knowing the definitions of “compact,” “closed,” and “bounded.” Here is the proof 
outline: 
Assume X C R*. Prove X is compact iff X is closed and bounded. 
Part 1. Assume X is compact. 

Part 1a. Prove X is closed. (...) 

Part 1b. Prove X is bounded. (...) 

Part 2. Assume X is closed and bounded. Prove X is compact. (...) 
Comments: The main logical operator in the given statement is IF, so we generate the first 
line of the solution using the direct proof template for IF. This reduces us to proving an 
IFF-statement, which requires a two-part proof. We use the direct proof template two more 
times (along with the template for AND in part 1) to finish the proof outline. We write 
(...) for omitted portions of the proof, which would require us to know the definitions of the 
terms involved. Note that the assumption X C R* is available throughout the entire proof, 
whereas the assumption that X is compact can be used only in part 1. Other outlines are 
possible. For instance, we could use a contrapositive proof in part 2 and write “Assume X 
is not compact; prove X is not closed or not bounded.” This new goal could be expanded 
further using the OR-proof template, given later. 


Formally Justifying the Proof Templates (Optional) 


We have given intuitive justifications for the new proof templates in this section based on 
truth tables. We can give a completely rigorous derivation of these templates based on 
certain tautologies and the Inference Rule for IF. For example, consider the template for 
proving the AND-statement P ^Q A R. Suppose we can prove P by itself, Q by itself, and R 
by itself. Note that (P > (Q > (R = (P AQ ^ R)))) is an axiom, because it is a tautology 
(as one may check with a truth table). Applying the Inference Rule for IF to this tautology 
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and the known statement P, we get (Q > (R > (PAQA R))). Applying the Inference 
Rule for IF to this new statement and the known statement Q, we get (R > (PAQA R)). 
Finally, applying the Inference Rule for IF to this statement and the known statement R, 
we get the sought-for theorem P ^A QA R. This chain of deductions, using the Inference 
Rule for IF three times in a row, is the same for any AND-statement P A Q A^ R. The proof 
template for AND lets us condense this argument by just proving P, Q, and R separately. 

Next, consider the contrapositive proof template for proving P => Q. Assume the direct 
proof template for IF-statements is already available. Then assuming ~Q and proving ~P 
proves the new theorem (~Q) => (~P). Using this theorem and the tautology 


(~Q) => (oP) => (P => Q), 


the Inference Rule for IF gives the new theorem P = Q. The proof template for IFF can 
be justified similarly (see Exercise 24). 


Section Summary 


1. Contrapositive Proof Template. To prove P = Q via a contrapositive proof: 
Assume ~Q. Prove ~P. 


2. AND Proof Template. To prove Pi A P2 A+++ A Pr: 
Part 1. Prove P}. 
Part 2. Prove P>. 


Part n. Prove P,. 


3. IFF Proof Template. To prove P & Q: 
Part 1. Prove P => Q (by any method). 
Part 2. Prove Q => P (by any method). 


4. Points of Advice. (a) Memorize the new proof templates! (b) Given a complex 
statement built up using multiple logical operators, you need to combine multi- 
ple proof templates to generate the structure of the proof. (c) When do we use 
contrapositive proofs? Try this method when a direct proof fails, or when the 
original statement contains negative logic, or when the hypothesis of the given 
IF-statement seems unwieldy. (d) When proving an IFF-statement, do not forget 
that both implications must be proved! Also, the two parts of the proof are com- 
pletely independent, so that assumptions made in part 1 cannot be used in part 
2, and vice versa. But letters used in part 1 can be reused in part 2, with new 
meanings. 


Exercises 


1. Use the proof templates studied so far to write just the structural outline of a 
proof of each statement. Do not expand definitions of unfamiliar words. 
(a) If G is a cyclic group, then G is commutative. [use contrapositive proof] 
(b) f is a bijection iff f is injective and surjective. [use direct proof for both parts] 
(c) If f is continuous or monotonic, then f is integrable. [use contrapositive proof] 
2. Let a,b,c be fixed integers. Write a contrapositive proof of each statement. 
(a) If ab is even, then a is even or b is even. 
(b) If a does not divide bc, then a does not divide c. 


3. Let x and y be fixed real numbers. Prove: if x — y is irrational, then zx is irrational 
or y is irrational. 
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14. 


15. 


16. 


17. 
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Let m,n be fixed integers, and let x be a fixed real number. Prove each IFF- 
statement. 

a) 2x +3 > 11 iff x > 4. 

b) n is odd if and only if —3n is odd. 

c) n divides m iff n divides —m. 


Let x be a fixed real number. Prove: 

a) 3x +7 = 5x — 4 iff x = 11/2; 

b) (x +3)? = (z — 1)? iff x = —1. 

Let x be a fixed real number. Prove: 

a) 9x +4 < 2a — 10 iff a < —2; 

b)2<a<5iff-6<4-2¢ <0; 

c) (x + 2)? > (x +5)? iff x < —3.5. 

Let x be a fixed real number. Prove: if x5 + 2x3 +x < 50, then x < 2. 


Let x and y be fixed real numbers. Prove: if x < y then x < (2x + y)/3 < 
(x + 2y)/3 < y. 

Let x be a fixed integer. Prove that x is odd iff x? is odd: (a) using (2.1) and the 
definitions; (b) using (2.1) and the theorem proved in Example 2.30. 


Let c be a fixed integer. Prove da € Z,c = 10a + 2 iff 3b € Z,c = 5b — 3 and bis 
odd. 


Let a, j, r,s be fixed integers. Prove: if j divides r + s, then: a = bj + r for some 
integer b iff a = cj — s for some integer c. 


Let a,b,d,q,r be fixed integers. Using only the definitions, prove: if b = aq +r, 
then: (d divides a and d divides r) iff (d divides a and d divides b). 


(a) Informally justify the contrapositive proof template for proving P => Q by 
analyzing the truth table for P > Q. (b) Informally justify the proof template 
for proving P = Q by analyzing the truth table for P = Q. 


Write a proof outline for each statement based on proof templates studied so far. 
a) (PeQ)=> (S T). 

b) (Ax, P(x)) © (QA Ay, R(y)). 

c) dz € U,[P(x) > (~Q(x) V R(x))]. 

Find the errors (if any) in each proof outline below. 

a) Statement: If ab € P, then a € P or b € P. Proof Outline. We use a contra- 
positive proof. Assume a ¢ P or b ¢ P. Prove ab ¢ P. (...) 

b) Statement: F is closed iff X — F is open. Proof Outline. Part 1. Assume F is 
closed. Prove X — F is open. (...) Part 2. Assume F is not closed. Prove X — F 
is not open. (...) 

c) Statement: A is invertible iff det(A) 4 0. Proof Outline. Part 1. Assume A 
is invertible. Prove det(A) # 0. (...) Part 2. Assume det(A) = 0. Prove A is not 
invertible. (...) 


Identify and correct all errors in the following proof that x is odd iff 3x +7 is even 
(where « is a fixed integer). Proof. Assume x is odd, which means x = 2k + 1. 
We must prove 3x + 7 is even, which means 3x + 7 = 2k for some integer k. By 
algebra, we know 3a + 7 = 3(2k + 1) +7 = 6k + 8 = 2(3k + 4). So we can choose 
k = 3k + 4. 


(a) For fixed positive real numbers x and y, prove: if x Æ y then 2? 4 y?. 
(b) State the converse of part (a). Is the converse always true? 
(c) Is Yz, y € R, £ = y & x? = y? true or false? Explain. 
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18. 


19. 


20. 
21. 


22. 


23. 


24. 


Prove (2.2) and (2.3) using only the ordering axioms for the set of real numbers 
(see axioms O1 through O6 in §8.1). 


Use (2.3) (and the analogous fact for >) to prove: for any fixed z,y E€ R>o, £ < y 
iff 2? < y?. 
Let r and s be fixed positive real numbers. Prove: if r < s, then r < yrs < s. 


Suppose U = {21, z2,...,2n} is a finite set. Create a proof template for proving 
Va € U, P(x) by converting this quantified statement to an AND-statement. 


Create and justify two proof templates for proving P V Q by replacing this OR- 
statement by an equivalent IF-statement. 


Create and justify a proof template for proving P 6 Q by replacing this XOR- 
statement by an equivalent IFF-statement. 


Carefully justify the proof template for IFF using the Inference Rule for IF and 
an appropriate tautology. 
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2.4 Proofs by Contradiction and Proofs of OR-Statements 


This section introduces a powerful general technique for proving statements called proof by 
contradiction. After studying some proofs using this technique, we introduce some proof 
templates for proving XOR-statements and OR-statements. We will see that previous proof 
templates for proving IF-statements (the direct proof and contrapositive proof methods) can 
be viewed as special cases of the OR-proof templates. This should not be too surprising, 
since we know an IF-statement can always be converted to an equivalent OR-statement. 


Template for Proof by Contradiction 


The method of proof by contradiction is a versatile, but sometimes confusing, technique 
for proving many different kinds of statements. The intuition for this proof method is as 
follows. Suppose we are trying to prove that some statement P is true. One way to proceed 
is to assume that P is false, and see what goes wrong. More specifically, if the assumption 
that P is false leads to a contradiction, then P must be true after all. These ideas suggest 
the following proof template. 


2.33. Proof Template to Prove Any Statement P by Contradiction. 
Assume, to get a contradiction, ~P. 

Use this assumption and other known facts to deduce a contradiction C. 
Conclude that P must be true after all. 


2.34. Remarks on the Contradiction Proof Template. 


(a) As always, the logical status word “assume” in Step 1 is mandatory! The next phrase 
in Step 1, “to get a contradiction,” while not absolutely mandatory, is highly recommended. 
The reader of your proof is likely to get confused if you suddenly assume a false statement 
~P, which is the very opposite of the statement to be proved. You can allay this confusion 
by stating at the outset that the sole purpose of this assumption is to reach a contradiction. 
Similarly, upon reaching a contradiction at the end of the proof, it is good to finish with a 
line such as: “This contradiction shows that the original statement P is true.” 


(b) In Step 1, we almost always replace the assumption ~P with a useful denial of P, found 
using the denial rules. 


(c) Step 2 of the proof template tells you to “deduce a contradiction,” but it does not 
tell you exactly what contradiction you are aiming for. This makes proofs by contradiction 
harder to finish compared to other types of proofs, since you do not have a specific goal to 
reach. You must explore the consequences of the false assumption ~P, combining it with 
other known facts to see what happens. 


(d) Remember that a contradiction is a propositional form that evaluates to false in every 
row of its truth table; a typical example is PA(~P). To be perfectly precise, Step 2 is really 
deducing an instance of a contradiction, which is a proposition obtained by replacing every 
propositional variable in the contradiction by specific statements. 


Before looking at examples of proof by contradiction, we give a technical justification 
for why the proof method works. Suppose we succeed in completing Steps 1 and 2 of 
the contradiction proof template. Then, according to the direct proof template discussed 
earlier, we have really proved the statement (~P) = C. Now, recall from Example 1.31 
that ((~P) = C) => P is a tautology and hence an axiom. Since we have also just proved 
(~P) = C, it follows from the Inference Rule for IF that P is a theorem, as needed. 
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Examples of Proof by Contradiction 


We now give examples of the method of proof by contradiction. 


2.35. Example. Prove that no integer is both even and odd. 

Proof. We must prove: ~dz € Z,x is even/ x is odd. Assume, to get a contradiction, 
that there does exist x € Z such that x is even and æ is odd. This assumption means 
that for some integer k, x = 2k, and for some integer m, x = 2m + 1. Combining these 
assumptions, we see that 2k = 2m + 1, hence 2(k — m) = 1, sok —m = 1/2. On one 
hand, we know 1/2 is not an integer. On the other hand, since k,m € Z, 1/2 =k—m 
is an integer by closure properties of Z. We have reached the contradiction “1/2 is not an 
integer and 1/2 is an integer.” Thus, our assumption is wrong, meaning that no integer is 
both even and odd. 

Comments: The given statement P begins with NOT, so we began the proof by using the 
double negation rule to deny P. We then expanded definitions until discovering a contra- 
diction. 


2.36. Example. Prove: for all positive integers a and b, if a divides b then a < b. 

Proof. Assume, to get a contradiction, that there exist positive integers a,b such that 
a divides b and a > b. We know there is an integer c with b = ac. Since a and b are both 
positive, we see that c = b/a must also be positive. As c is an integer, c > 1 follows. But 
now, by a known theorem (see (2.3) in $2.3), we deduce 


b=ac>a:l=a>b. 


We have reached the contradiction b > b, so the original statement is true. 
Comments: To see the contradiction more explicitly, note that b > b is the negation of b < b. 
Since b < b is a known fact, the proof has derived the contradiction “(b < b)A ~(b < 6).” 


2.37. Example. Prove: for all x,y € R, if x is rational and y is irrational, then x + y is 
irrational. 
Proof. Assume, to get a contradiction, that there exist real numbers x, y such that x is 
rational and y is not rational and x+y is rational. Expanding definitions, our assumption 
means that x = m/n and x + y = p/q for some integers m,n,p,q with n 4 0 4 q. By 
algebra, we deduce that 

y= (+9) ~#= (p/d) ~ (mm) =, 
where pn— qm E Z, qn € Z, and qn 4 0. Comparing to the definition of “y is rational,” this 
calculation proves that y is rational. We now have the contradiction “y is rational and y 
is not rational.” So the original statement must be true. 


Next we prove a famous theorem using proof by contradiction. We need one fact in the 
proof that we have not yet justified from basic principles: any fraction m/n with m,n € Z 
and n Æ 0 can be written in lowest terms, meaning that no integer larger than 1 divides 
both m and n. In particular, this means that m and n can be chosen so that m and n are 
not both even (divisible by 2). For one proof of this fact, see Exercise 9 in §4.6. 


2.38. Theorem: V2 is irrational. 

Proof. Assume, to get a contradiction, that v2 is rational. This assumption means 
that for some integers m,n with n Æ 0, V2 = m/n. By the fact mentioned above, we are 
allowed to assume that m and n are not both even. We reach a contradiction by proving 
that, in fact, m and n must both be even. Squaring v2 = m/n and rearranging, we deduce 
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that m? = 2n?, where n? is an integer. This equation means that m? must be even. Recall 
a theorem already proved in the last section: “if z? is even, then g is even.” Using this 
result and the Inference Rule for IF, we see that m is even. So, there is an integer k with 
m = 2k. Now, m? = 2n? becomes (2k)? = 2n?, so 4k? = 2n?, so 2k? = n?. Here, k? is an 
integer, so n? is even. By the same theorem used earlier, we deduce that n is even. We 
now have the contradiction “m and n are not both even, and m is even, and n is even.” 
The only way out of the contradiction is to admit that v2 is not rational. 


So far, we have checked that propositional forms are tautologies by writing out truth 
tables. The next example shows that proof by contradiction can also be used to demonstrate 
that a propositional form is a tautology without making a giant truth table. 


2.39. Example. Use proof by contradiction to show that 
(PVQVR)A(PSSJA(QSS)A(RSS)JSS (2.4) 


is a tautology. 

Proof. |Recall, by definition, that A is a tautology iff every row of the truth table for A 
evaluates to true. Note the universal quantifier in the definition text. So, to begin the proof 
by contradiction, we deny this and say:] Assume, to get a contradiction, that some 
row of the truth table for the propositional form (2.4) evaluates to false. In such a row, let 
us see what we can deduce about the truth values of various subexpressions of (2.4). The 
overall propositional form is a false IF-statement in this row, which (by the truth table for 
IF) forces 

(PVQVR)A(PSS)A(Q=>S)A(R= S) 


to be true and S to be false. Next, the truth of the AND-statement just written forces each 
input to the AND to be true, so we see that 


PVQVR, P=}=S, Q=S5, R=S 


are all true in this row. Now, since P > S is true and S is false, it must be that P is false in 
this row (by the truth table for IF or the Contrapositive Inference Rule for IF). Similarly, 
Q must be false and R must be false in this row. But now P V Q V R must be false in this 
row, and yet also true. This contradiction shows that the original propositional form must 
be a tautology. 


Proving XOR Statements and OR Statements 


We close this section by introducing proof templates for proving XOR-statements and OR- 
statements. The template for XOR arises from the logical equivalence P $ Q = P & (~Q), 
together with the IFF template discussed earlier. 


2.40. Proof Template to Prove PQ. 
Part 1. Prove P > (~Q) (by any method). 
Part 2. Prove (~Q) = P (by any method). 


Similarly, we can create templates for proving P V Q by converting the OR-statement 
to an IF-statement and invoking previous proof templates. Specifically, note that 


PVR = (P)=Q = (Q)>P. 


This leads to the following templates. 
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2.41. First Proof Template to Prove PV Q. 
Assume ~P. Prove Q. 


2.42. Second Proof Template to Prove PV Q. 
Assume ~Q. Prove P. 


2.43. Example. Let x,y be fixed real numbers. Prove: if x # 0 and y £0, then ry 4 0. 
Proof. [The inequalities in the given statement seem hard to work with; we can convert 
these to equalities by doing a contrapositive proof. So we begin:| Assume zy = 0. Prove 
x = 0 or y = 0. Our new goal is an OR-statement, so we use the first template and assume 
x #0. We must prove y = 0. [We must now combine our two assumptions (xy = 0 and 
x #0) to reach the new goal y = 0. We do so as follows:] It is known that the nonzero real 
number x has a multiplicative inverse x7! € R satisfying x~'x = 1. Multiplying both sides 
of ry = 0 by x7! gives x! (xy) = x710. By algebra, this becomes ly = 0, giving y = 0. 


We can prove an OR-statement with more than two alternatives as follows. 


2.44. Proof Template to Prove PVQVRVS. 
Assume ~P and ~Q and ~R. Prove S. 


This proof template can be justified via truth tables, or by the Inference Rule for IF 
and the tautology 


((~P) A (~Q) A (~R)] => S) > (PVQVRVS). 


When using the template, we can assume the negations of any three of the four alternatives 
and then prove the remaining alternative. The pattern seen in this template extends to a 
template for proving A, V Ag V--- V Ak where k > 2. To prove such an OR-statement, 
assume all but one of the A; is false, and then prove the remaining Aj. 


Section Summary 


1. Template for Proof by Contradiction. To prove a statement P by contradiction: 
Assume, to get a contradiction, ~P. 
Combine the assumption with other known facts to deduce a contradiction. 
Say: “This contradiction shows that P is true, after all.” 


2. Advice on Contradiction Proofs. (a) Signal the start and end of a contradiction 
proof with phrases similar to those in the proof template, to lessen the risk of 
confusing the reader (or writer) of the proof. (b) Use the denial rules to find a 
useful form of ~P. (c) The proof template does not give you a specific contradic- 
tion to aim for, which makes this template hard to use compared to others. You 
need creativity and patience to explore the consequences of the given assumptions 
until a contradiction is found. 

3. Proof Template for XOR. To prove P@Q: 

Part 1. Prove P = (~Q) by any method. 
Part 2. Prove (~Q) => P by any method. 
4. Proof Template for OR. To prove PVQV RV S: 
Assume ~P and ~Q and ~R. Prove S. 
(Similarly for OR-statements with any number of alternatives.) 
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Exercises 


1. Use proof templates for IF, OR, etc., to write a proof outline for each statement. 
a) If f is continuous or monotonic, then f is integrable. [use contradiction proof] 
b) If P is a prime ideal and ab € P, then a € P or b € P. [use direct proof] 

c) If P is a prime ideal and ab € P, then a € P or b € P. [use contrapositive 
proof] 

d) Yz € Z, Yy €Z,Vz € Zn E Z, (x > 0 and y > 0 and n > 2) > r” +y” #2”. 
use proof by contradiction] 

e) Yn € Z, Ap € Z, p > nAp is prime ^p + 2 is prime [use proof by contradiction] 


2. Let x be a fixed real number. Prove each statement by contradiction. 

a) If x 40, then —x Æ 0. (b) If x £0, then 1/x 40. 

3. Prove by contradiction: Vz € R,Vy € R, if x € Q and y ¢ Q and zx ¥ 0, then 
zy ¢ Q. 

4. Prove by contradiction: Vm € Z, Yn € Z, 6m + 8n Æ 5. 

5. Prove by contradiction: P > (Q => P) is a tautology. 


6. Suppose, in a certain proof, we assume ~ P, go through a chain of reasoning, 
and eventually deduce P. What, if anything, can we conclude about the truth or 
falsehood of P? 


7. Let x be a fixed real number. (a) Prove: if x? + 4x — 60 = 0, then z = —10 or 
x = 6. (b) Is the converse of part (a) true? 
8. For fixed x € R, prove: if x? > x, then x < 0 or z > 1. 
9. For fixed m € Z, use (2.1) to prove: 2 divides m XOR 2 divides m + 7. 
10. For fixed x,y € Z>o, prove: if x + y = 5, then x = 1 or x = 2 or x = 3 or x = 4. 


11. (a) Find a statement equivalent to (PAQ) = R that does not use A or =>. (b) Give 
four possible proof outlines of (P A Q) = R, each with different assumptions and 
a different goal. 


12. Give four possible proof outlines of A > (BV (~C)). 


13. Suppose this fact is a known theorem: for all a,b € Z, if 3 divides ab, then 3 
divides a or 3 divides b. Use this to prove that V3 is irrational, by imitating a 
proof in this section. 


14. Suppose we imitate the proof of Theorem 2.38 in an attempt to prove that v4 is 
irrational. Find the exact point in the proof that no longer works. 


15. (a) Use the truth table for XOR to justify the XOR proof template. (b) Use the 
truth table for OR to justify the two proof templates for proving P V Q. (c) Use 
any method to justify the proof template for proving PVQV RVS. 


16. Make a new proof template for proving P@Q based on the equivalence (P®Q) = 
(PV Q)A ~(P A Q); prove both parts of the AND-statement by contradiction. 


17. Assume p,a, b,c are fixed integers, and p is known to satisfy: Va, y € Z, if p divides 
zy, then p divides x or p divides y. Prove: if p divides abc, then p divides a or p 
divides b or p divides c. 

18. Fix m € Z. Find the error in the following proof that 6m is odd. Proof. Assume, to 
get a contradiction, that 6m is not odd. So 6m is even, which means Jk € Z, 6m = 
2k. Choose k = 3m, which is in Z by closure. By algebra, 2k = 2(3m) = 6m, 
proving that 6m is even. 
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19. Find the error in the following proof outline of the statement (P > Q) > (~S). 
Proof. To get a contradiction, prove (P > Q) A S. Part 1. Assume P; prove Q. 
Part 2. Prove S. 


20. Prove by contradiction: [P > (Q > R)] > [(P = Q) = (P => RB) is a tautology. 


21. Let m be a fixed integer. Recall the theorem that every integer is even or odd, 
but not both. Using this, prove: 
if m is even, then (Sk € Z, m = 4k) @ (Jj € Z,m = 4j + 2); 
but if m is odd, then (da € Z, m = 4a + 1) $ (3b € Z,m = 4a + 3). 


22. Let b be a fixed odd positive integer. Prove V2 is not rational. 
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2.5 Proofs by Cases and Disproofs 


We ended the last section by describing ways to prove OR-statements that are not yet known 
to be true. Now, we consider the related question of how to use an OR-statement that is 
already known to be true. We introduce a new proof method called proof by cases that is 
often needed in situations where we already know (or have assumed) an OR-statement and 
need to prove some other statement. The second half of the section considers the question 
of how we may disprove statements we believe to be false. 


Proof by Cases 


We introduce proof by cases in the situation where we know (or have assumed) an OR- 
statement with three alternatives. 


2.45. Proof Template for Proof by Cases. Suppose PVQV R is a known OR-statement. 
To prove any statement S' using proof by cases: 

Say “Since P V Q V R is known, we consider 3 cases.” 

Case 1. Assume P. Prove S. 

Case 2. Assume Q. Prove S. 

Case 3. Assume R. Prove S. 

Conclude by saying “So, S is true in all cases.” 


2.46. Remarks on Proof by Cases. 


(a) The proof outlined above has three completely separate subproofs, called the three cases. 
In each subproof, we must prove the target statement S. Each of these three subproofs begins 
with a different temporary assumption (P, Q, or R), so the details of these subproofs are not 
the same. Assumptions and conclusions appearing in the proof of one case cannot be reused 
in another case unless they are reproved under the assumptions for that case. Temporary 
variables used in one case can be reused (with new meanings) in other cases. 


(b) The proof template extends to the situation where we start with a known OR-statement 
Pi V Pa V--- V Pk with k alternatives, where k > 2, and we are trying to prove S. In this 
setting, there are k cases. The proof outline looks like this: 


Since Pi V--- V Pk is known, consider k cases. 
Case 1. Assume P,. Prove S. 


Case i. Assume P;. Prove S. 


Case k. Assume P;. Prove S. 
So, S is true in all cases. 


(c) Here is a technical justification for why the proof-by-cases template is legitimate. On 
one hand, in order to use this template, we must already know (or have proved) PV QV R. 
On the other hand, Case 1 of the template proves the IF-statement P = S; Case 2 proves 
Q = S; and Case 3 proves R > S. By the AND-template, we have proved 


(PVQVR)A(PSS)A(QSS)A(R=S). 
We also have the following axiom (a tautology proved in (2.4) of the last section): 


(PVQVR)A(PSS)A(QSS)A(R=SS) S58. 
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Applying the Inference Rule for IF to the two displayed theorems, we see that S is a new 
theorem, as needed. 


Examples of Proof by Cases 


Let us look at some sample proofs where proof by cases can be used. We remark that you will 
generally not be told: “use proof by cases here.” Writers of proofs must recognize situations 
where proof by cases can be applied. As a general rule, if you encounter a situation where 
you know or have assumed an OR-statement (say because a proof template directed you to 
write one), then proof by cases gives you one potential way to utilize that OR-statement. 


2.47. Example. Let a and b be fixed integers. Prove: if a is even or b is even, then ab 
is even. Proof. Assume a is even or b is even. Prove ab is even. [Noting that we have just 
assumed an OR-statement, we continue by using proof by cases.] 

Case 1. Assume a is even. Prove ab is even. We have assumed there is an integer k with 
a = 2k. We must prove dm € Z,ab = 2m. Multiplying our assumption by b gives ab = 2kb. 
Choosing m to be kb, which is an integer, we get ab = 2m as needed. 

Case 2. Assume b is even. Prove ab is even. We have assumed there is an integer k with 
b = 2k. We must prove dm € Z,ab = 2m. Multiplying our assumption by a gives ab = 2ka. 
Choosing m to be ka, which is an integer, we get ab = 2m as needed. 

Comment: Although the two cases had nearly identical proofs, both were logically necessary. 


2.48. Example. For a fixed integer n, prove: if 30 divides n or 70 divides n, then 10 divides 
n. Proof. Assume 30 divides n or 70 divides n. Prove 10 divides n. [We have assumed an 
OR-statement, so we decide to use proof by cases. 

Case 1. Assume 30 divides n. Prove 10 divides n. We have assumed n = 30k for some 
integer k. We must prove n = 10m for some integer m. Note n = 30k = 10(3k). So we 
choose m = 3k, which is an integer such that n = 10m. 

Case 2. Assume 70 divides n. Prove 10 divides n. We have assumed n = 70k for some 
integer k. We must prove n = 10m for some integer m. Note n = 70k = 10(7k). So we 
choose m = 7k, which is an integer such that n = 10m. 


2.49. Example. Let n be a fixed integer. Prove: n? — n is even. 

Proof. We must prove Jk € Z,n?—n = 2k. [Where do we go from here? Thinking of the next 
sentence in this proof requires a leap of creativity, even though the sentence itself seems to 
be stating a rather mundane fact.] We know n is even or n is not even (this is an axiom). 
Since we have stated a known OR-statement, we can try proof by cases. 

Case 1. Assume n is even. Prove Jk € Z,n? — n = 2k. We have assumed n = 2m for some 
integer m. We deduce from this that n? — n = (2m)? — 2m = 4m? — 2m = 2(2m? — m). We 
now see that we can choose k = 2m? — m (which is an integer since m is) in order to have 
n? —n=2k. 

Case 2. Assume n is not even, so n is odd by a known theorem. Prove Jk € Z,n? — n = 2k. 
We have assumed n = 2m + 1 for some integer m. Now, algebra gives 


n? —n = (2m +1)? — (2m +1) = 4m? + 4m +1 — 2m — 1 = 4m? + 2m = 2(2m? + m). 


Choose k = 2m? + m; this is an integer such that n? — n = 2k. So, n? — n is even in all 
cases. 


2.50. Example. Let x be a fixed real number. Prove: x? = 42 iff x = —2 or z = 0 or z = 2. 
Proof. This IFF-statement requires a two-part proof. 
Part 1. Assume x? = 4x. Prove x = —2 or x = 0 or x = 2. Weare proving an OR-statement 


now, so we follow the OR-template. Assume « 4 —2 and x Æ 0. Prove x = 2. Since x Æ 0, 
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we can divide both sides of the assumption z? = 4z by x to deduce x? = 4, so x? —4 = 0, so 
(a — 2)(a + 2) = 0. Since z # —2, we can divide both sides of this equation by the nonzero 
quantity x + 2 to get x — 2 = 0, hence x = 2, as needed. 


Part 2. Assume z = —2 or x = 0 or x = 2. Prove z? = 4a. Here we have assumed an 
OR-statement, so we use proof by cases. 
Case 1. Assume x = —2. Prove x? = 4x. By arithmetic, x? = —8 = 4g in this case. 


Case 2. Assume x = 0. Prove z? = 4x. By arithmetic, 2? = 0 = 4z in this case. 
Case 3. Assume x = 2. Prove x? = 42. By arithmetic, x? = 8 = 42 in this case. 
Since x? = 4z in all cases, we are done with part 2. 


Definition by Cases 


Sometimes cases are used to define a new mathematical concept. Proofs involving such 
concepts often rely on proof by cases. We give a few examples here involving the sign and 
absolute value of real numbers. 


2.51. Definition: sgn(z). For each real number x, define sgn(z) = +1 if x > 0, sgn(x) = 0 
if x = 0, and sgn(x) = —1 if x <0. 


For this definition by cases to make sense, we need to know that for each real x, exactly 
one of the statements “x > 0,” “x = 0,” or “x < 0” is true. We take this to be a known 
fact from algebra. When we prove theorems about sgn(x), we often start the proof with the 
known fact “x > 0 or x = 0 or x < 0,” which leads into a proof by cases. 


2.52. Example. For fixed z € R, prove sgn(—z) = — sgn(z). 

Proof. We know z > 0 or z = 0 or z < 0, so use cases. 

Case 1. Assume z > 0; prove sgn(—z) = —sgn(z). Because z > 0, we know —z < 0 by 
algebra. By definition of sgn, we know sgn(—z) = —1 and sgn(z) = +1, so sgn(—z) = —1 = 
—sgn(z) holds in this case. 


Case 2. Assume z = 0; prove sgn(—z) = —sgn(z). Here —z = —0 = 0, so sgn(—z) = 
sgn(0) = 0 = —0 = —sgn(0) = —sgn(z) follows. 
Case 3. Assume z < 0; prove sgn(—z) = —sgn(z). Because z < 0, we know —z > 0 by 


algebra. Therefore, sgn(—z) = +1 = —(—1) = —sgn(z) in this case. 
We see that sgn(—z) = —sgn(z) holds in all cases. 


2.53. Definition: Absolute Value. For each real number r, define |r| = r if r > 0, and 
define |r| = —r ifr <0. 


For example, |0.23| = 0.23, whereas |—3] = —(—3) = 3. Note carefully that when r is 
negative, the expression —r denotes a positive real number. 


2.54. Example: For fixed x, y € R, prove |xy| = |æ] - |y|. 

Proof. [Here we need cases for both x and y. It turns out to be easier to create special cases 
when one variable is zero.}] We know z = 0 ory=Ooraz,y>O0orz>O0>yory>0>a 
or x,y < 0, so use cases. 

Case 1. Assume x = 0; prove |zy| = |z|- |y|. In this case, we know ry = Oy = 0, so 
|zy| = |0| = 0 and |z]: |y| = Oly| = 0. 

Case 2. Assume y = 0; prove |xy| = |a|-|y|. As in Case 1, ry = 0, so |xy| = 0 = |z|0 = |2|-|y]. 
Case 3. Assume x > 0 and y > 0; prove |ay| = |x|- |y|. By algebra, zy > 0. Using the 
definition of absolute value three times, we see that |xy| = ry = |x| - |y]. 


Case 4. Assume x > 0 and y < 0, so ry < 0. We compute |xy| = — (xy) = x: (—y) = |z|- Iyl. 
Case 5. Assume x < 0 and y > 0, so ry < 0. We compute |xy| = — (xy) = (=x) -y = |z|- ly]. 
Case 6. Assume x < 0 and y < 0, so ry > 0. We compute |xy| = xy = (—2)- (—y) = |z|- ly]. 


So |zy| = |x| - |y| holds in all cases, as needed. 
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Disproving Statements 


So far, we have given many proof templates that enable us to prove true statements P. 
Of course, many statements P are false, and we cannot prove such statements. Instead, we 
would like a method for disproving a statement we believe to be false. Since P is false iff 
~P is true, we are led to the following template. 


2.55. Proof Template to Disprove a False Statement P. 
Prove ~P. 


The next question is, how do we prove a true statement of the form ~P? We offer the 
following template as an answer. 


2.56. Proof Template to Prove ~P. 
Use the denial rules to replace ~P with an equivalent statement Q that does not begin with 
NOT. Prove Q. 


To complete Step 2, we recursively apply other templates, expand definitions appearing 
in Q, and so on. 


2.57. Example. Disprove: Vz € R, x? > x. 

Solution. By the disproof template, we must prove ~Vx € R, x? > x. Using the denial rules, 
we find that we must prove the statement: Ja € R, x? < x. [If you cannot initially find an 
example of this existential assertion, try drawing the graphs of y = x? and y = x.] We now 
see that we may choose x = 1/2, which is a real number such that z? = 1/4 < 1/2 = zx. 


2.58. Example. Disprove: for all a,b € Z, if a divides b, then a < b. 

Solution. Using the denial rules, we must prove there exist integers a,b such that a divides 
b and a > b. [Before reading further, can you think of an example of this situation? The 
key is to realize that a and b come from the universe Z, which includes negative integers. ] 
We can now choose a = 4 and b = —4 (say). Note that —4 = 4(—1) where —1 is an integer, 
verifying that 4 divides —4. We also know 4 > —4, finishing the proof. 


In the next example, the notation “x ¢ Q” means “~(x € Q),” i.e., x is not rational. 


2.59. Example. Disprove: for all real x,y, if x ¢ Q andy ¢ Q, then z + y ¢ Q. 

Solution. We must prove there exist x,y € R with x ¢ Q and y ¢ Q and z +y E Q. 
Recall from Theorem 2.38 that v2 is not rational. So, let us choose x = V2 and y= —V/2. 
The fact that y is also irrational follows from Example 2.31, taking r = v2. We know 
x+y = V2 + -v2 = 0 = 0/1, where 0 and 1 are integers and 1 Æ 0. So z +y € Q, as 
needed. 


The last example shows that the set of irrational real numbers is not closed under 
addition. Similarly, we ask the reader to check below that the set of irrational numbers is 
not closed under multiplication. 


2.60. Remark. Beginners sometimes confuse disproofs and proofs by contradiction. Let us 
contrast these two situations. To prove a true statement P by the contradiction method, we 
begin with the line “Assume ~P to get a contradiction.” Our goal in the proof is to deduce 
a contradiction (not specified in advance). In contrast, to disprove a false statement P 
by the disproof method, we begin with the line “Prove ~P.” The next step in both proofs 
is to find a useful denial of ~P by the denial rules. But the two proofs use this denial in 
totally different ways, since ~P is the assumption in the contradiction proof and the goal 
in the disproof. To avoid confusion, always write (and pay attention to!) these logical status 
words. 
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Section Summary 


1. Proof by Cases. When you know or have assumed an OR-statement PV QV R 

and need to prove another statement S, say this: 

“Since P V Q V R is known/assumed, we consider 3 cases. 

Case 1. Assume P. Prove S. 

Case 2. Assume Q. Prove S. 

Case 3. Assume R. Prove S. 

So, S is true in all cases.” 

(Similarly, a known OR-statement P; V P2 V- - -V P, leads to a proof with n cases.) 

2. When to Use Proof by Cases. If some proof template directs you to assume an 
OR-statement, you often need proof by cases to use this assumption. To prove 
properties of concepts defined via cases, you typically must use proof by cases 
based on the same cases appearing in the definition. Axioms that are instances 
of the tautology P V (~P) are known OR-statements that often lead to proof 
by cases. For example, given a fixed integer no, we know “ng is even or no is not 
even.” Given a fixed real number rg, we know “ro > 0 or rọ < 0.” Given a fixed 
object xp and universe U, we know “zo € U or zo ¢ U.” 

3. Disproving False Statements. To disprove a false statement P, prove the nega- 
tion ~P. To prove a statement ~P that begins with NOT, use the denial rules 
to replace this statement with an equivalent statement not starting with NOT. 
Remember to flip any quantifiers that occur at the beginning of the statement 
being denied. 

Exercises 

1. Give proof outlines of the following statements. 

(a) If f is continuous or monotonic, then f is integrable. [use direct proof] 
(b) It is false that every injective function has a left inverse. 
(c) (PV Q) ® (RA (~S)). 

2. Consider the statement (A V B V (~C)) > (DA E). Give proof outlines of this 
statement based on: (a) a direct proof and proof by cases; (b) a contrapositive 
proof and proof by cases; (c) a contradiction proof. 

Let n be a fixed integer. Prove: if 6 divides n or n + 5 is odd, then n is even. 
Let k be a fixed integer. Prove: k? + 3k is even. 

(a) Let x and y be fixed integers. Prove: if x is even or y is even, then x7y° is 
even. (b) Disprove: Vx € Z,Vy € Z, if x is odd or y is odd, then xy? is odd. 

6. Let x be a fixed real number. 

(a) Prove: if x < 1 or x > 3, then z? — 4x +3 > 0. Must the converse hold? 
(b) Prove: zì — x > 0 if —1<x<0orzr>1. 

7. Let z € R be fixed. Prove: for z? — 7z + 10 to be positive, it is necessary and 
sufficient that z < 2 or z > 5. 

8. Let x be a fixed real number. 

(a) Prove: if x? is rational or zì is rational, then zê is rational. 
(b) Prove: if xë ¢ Q, then x? ¢ Q or z? ¢ Q. 
9. (a) Disprove: for all m,n € Z, if 10 divides mn, then 10 divides m or 10 divides 


n. (b) Disprove: for all a,b € Zo, if a divides b, then a < b. 
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10. 


11. 
12. 
13. 
14. 


15. 
16. 


17. 
18. 


19. 


20. 
21. 
22. 


23. 


24. 
25. 
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(a) Disprove: for all x,y € R, if x Q and y ¢ Q, then zy Z Q. 
(b) Disprove: for all x,y € Ryo, if x Z Q and y Z Q, then z +y ¢ Q. 


For fixed x,y € R, prove: sgn(zy) = sgn(x) sgn(y). 


For fixed x € R, is sgn(x?) = sgn(x) true or false? Explain. 
For fixed x € R, prove: (a) |x| > 0; (b) |—2| = |x|; (c) « = sgn(x)|x]. 


(a) Disprove: for all x,y € R, |x + y| = |z| + |yl. 
(b) Disprove: Vr € R,sgn(x?) = +1. 
(c) Disprove: for all x,y € R, sgn(x + y) = sgn(sgn(x) + sgn(y)). 


For fixed x,y € R, prove: |x + y| < |x| + |y|. 


Use the previous exercise (which holds for all x,y € R) to deduce the following 
facts for fixed real numbers r, s, t. 

(a) |r +s+t| < |r] + [s| + tl. 

(b) |r —t| < |r — s| + |s — tl. 

(c) |r = s| < [r] + |s]. 

For fixed x,y € R, prove: ||| — |y|| < |x — yl. 


Let x,a,r be fixed real numbers. (a) Prove: |x — a| < r iffa-r<a<a+tr. 
(b) Use (a) and logic to deduce a condition equivalent to |x — a| > r. 


Find the error in this attempted disproof of “for some positive integers x,y,z, 
x+y? = 23.” Disproof: We must prove that for some positive integers 2, y, z, 
xr? +y? Æ z’. Choose x = y = 1 and z = 2. We compute z? +y? =14+1= 2, 
while 23 = 23 = 8. 

Find and correct the error in this outline of a disproof of “P > (Q => R)?” 
Disproof: Assume P and Q are true and R is false. Deduce a contradiction. 


Define f(x) = x? +1 for 0 < x < 2, and f(z) = 3 -— z for 2 < x < 3. Fora fixed 
x € R, prove: if 0 < x < 3, then 0 < f(x) <5. 


For a fixed integer m, prove m? + 5m — 3 is odd. 


Suppose we know this theorem: Vm € Z, Jj € Z,m = 4j Vm = 4j +1Vm= 
4j +2 V m = 4j + 3. Let b be a fixed integer. Prove: 4 divides b? or 8 divides 
b — 1. 


For fixed n € Z, prove: if n is odd, then 8 divides n? — n. 


Suppose “P V Q V R” is a known theorem. Justify the following proof template, 
which proves a statement S by combining the contradiction method with proof 
by cases. 

Proof of S. We know PV Q V R, so use cases. 

Case 1. Assume P and ~S. Deduce a contradiction C4. 

Case 2. Assume Q and ~S. Deduce a contradiction C2. 

Case 3. Assume R and ~S. Deduce a contradiction C3. 

Since all three cases led to a contradiction, S must be true. 
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2.6 Proving Quantified Statements 


So far, we have discussed proof templates involving the following logical operators: 4, >, A, 
=, 6, V, and ~. It is finally time to present templates for proving a universally quantified 
statement. This leads us to a tricky but essential topic: how to manipulate statements 
containing multiple quantifiers. 


Proof by Exhaustion 


Our first proof template for proving a universal statement “Vx € U, P(x)” is called proof by 
exhaustion. It can only be used when the universe U is a finite set. 


2.61. Proof Template for Proof by Exhaustion. Suppose U = {21, 22,...,2n} is a 
finite set. To prove Vx € U, P(x) in this situation: 

Step 1. Prove P(z1). 

Step 2. Prove P(z2). 


Step n. Prove P(zn). 


Intuitively, we can verify the universal statement by explicitly testing the truth of P(c) 
for every possible object c in the finite universe U. We can justify this proof method more 
formally by recalling the following rule from §1.4: 


Va E {21, 22,-.-,2n},P(x)| = Pla) A Plza) A+++ A Plén) |. 


To prove the universal statement, we may instead prove the equivalent AND-statement. 
Following the proof template for an AND-statement, we obtain the proof outline written 
above. The name “exhaustion” is used because we are exhaustively checking all possible 
cases, one at a time. 


2.62. Example. Every time we use a truth table to verify that a propositional form is a 
tautology, we are really doing a proof by exhaustion. To see why, recall that a propositional 
form T is a tautology iff every row in the truth table for 7 evaluates to true. This definition 
has the form Va € U, P(x), where U is the set of rows in the truth table, and P(x) is the 
statement “the propositional form 7 has output TRUE in row x of the truth table.” Since 
there are only finitely many rows in the truth table, U is a finite set. By filling in the truth 
table, we are checking the truth of P(x) for each row x, one at a time. 


Generic Element Proofs of Universal Statements 


When the universe U is not finite, proof by exhaustion does not work. We need a new 
strategy for proving a universal statement “Vx € U, P(x),” which says that P(x) is true for 
all objects xo in the potentially infinite set U. This strategy is called the generic-element 
method. 


2.63. Generic-Element Proof Template to Prove Vz € U, P(x). 
Let xo be a fixed, but arbitrary, object in U. 
Prove P(zxo). 


Before using this template, let us discuss its logical nuances by contrasting it with the 
template for proving Jx € U, P(x). When we prove the existence statement Jx € U, P(x), 


Proofs 83 


we do so by choosing one specific, constant object zo in U that makes P(x) true. For 
instance, we might say: “choose zo = 7.” More generally, zo could be an expression involving 
other constants introduced earlier in the proof. We complete the existence proof by showing 
that xo really is in U and proving that P(xo) really is true. 

In the new template to prove Vx € U, P(x), we also introduce a constant object zo in U, 
but we do not choose a specific value for this object. The letter xp used here represents 
a “generic” object in U that we do not control. We know nothing about xo initially, except 
that it comes from the universe U. If we can prove the proposition P(zo) using only this 
information, we may safely conclude that P(x) does hold for every object x in U. Here is a 
basic example. 


2.64. Example. Prove: Vx € Z, x divides 32?. 

Proof. Let xo be a fixed, but arbitrary integer. We must prove xo divides 322. Expanding 
the definition, we must prove Jk € Z,3x22 = kao. Choose ko = 320, which is an integer 
since zo € Z and 3 € Z and Z is closed under multiplication. We know kozo = (3%9)a9 = 
3x2, so the proof is done. 


In this proof, we used subscripts (zo and ko) to distinguish the constants xo and ko from 
the quantified variables x and k. Observe that xo (coming from the universally quantified 
variable x) is a generic constant, whose value is not chosen or controlled by us. But ko 
(coming from the existentially quantified variable k in the definition of “divides” ) is a specific 
constant that we choose in a way to make the proof work. Observe that kg can depend on 
the previously introduced constant £o; the expression 329 used for kg is a specific choice 
depending on the previous generic constant x. 

We remark that many of the examples in previous sections were really proving universally 
quantified statements. Since we had not yet introduced the generic-element template, we 
introduced those examples with statements like “Let n be a fixed integer.” So, for instance, 
Example 2.20 on page 59 actually proves the theorem 


Va € Z,Vb € Z,Vc € Z, if a divides b and b divides c then a divides c. 


Statements with Multiple Quantifiers 


We now discuss statements with multiple quantifiers, which reveal further distinctions be- 
tween the proof templates for universal and existential quantifiers. The first point is that we 
may safely reorder two universal quantifiers or two existential quantifiers, as stated below. 


2.65. Quantifier Reordering Rules. For any open sentence P(x, y): 


Va,Vy,P(a,y)| is equivalent to | Vy,Va, P(x,y) | 


da,dy,P(x,y)| is equivalent to | dy, da, P(x, y) |. 


Intuitively, the statements Vx, Vy, P(x, y) and Vy, Vx, P(x, y) are equivalent since both state- 
ments assert that P(20, yo) is true for all possible choices of the objects zo and yo. Similarly, 
da, dy, P(x,y) and dy, Jx, P(x,y) are equivalent since both statements assert the existence 
of two objects xo and yo (possibly equal) such that P(x, yo) is true. We can give more for- 
mal proofs of these rules using the proof templates we have developed; see §2.7 for details. 
We can also use restricted quantifiers here; for instance, 


Jx € U, Jy € V,P(x,y)| is equivalent to |3y € V, ax € U, P(x,y) |. 
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Recall from our discussion of definitions that we can change the letter used for a quan- 
tified variable, as long as the new letter does not already have a different meaning. For 
instance, the three quantified statements Vz, P(z) and Vw, P(w) and Vs, P(s) are equiva- 
lent, since each statement says that P(c) is true for every specific object c. We can do the 
same thing for statements with multiple quantifiers. For example, 


Va, dy,Vz,P(2,y,z)| is equivalent to | Vr, ds, Vt, P(r, s,t) |. 


Mixed Quantifiers 


A crucial fact of quantified logic is that the relative order of universal and existential quan- 
tifiers in a multiply-quantified statement has a dramatic effect on the meaning of the state- 
ment. The next examples illustrate this fact. 


2.66. Example. Prove or disprove: Vx € Z, Jy € Z,y > x. 

Solution. This statement is true; it can be translated into English by saying: “for every 
integer, there is a larger integer.” To prove it with the templates, we say: let xg be a fixed, 
but arbitrary integer. We must prove dy € Z, y > xo. Choose yo = zo +1. We know yo 
is an integer (by closure), and x) +1 > xo, completing the proof. In this proof, note carefully 
that the specific constant yo chosen in the existence proof was allowed to depend on the 
previously introduced generic constant xo. Contrast this situation to the next example. 


2.67. Example. Prove or disprove: Jy € Z,Vx € Z,y > x. (The only difference from the 
previous example is the order of the initial quantifiers.) 

Solution. If we tentatively try to prove this statement by using the templates, we must begin 
by choosing a specific, constant integer yo, and continue by proving Vz € Z, yo > x. What 
yo Shall we pick? If we try yo = 7, we must prove the statement Vx € Z,7 > x, which is 
evidently false. If we try yo = 100, we must prove the statement Vx € Z,100 > x, which is 
also false. We might try yo = œœ, but oo is not an integer. Why can’t we choose yo = £o +1, 
as we did in the previous proof? The reason is that in this proof, the fixed constant integer 
xp has not been introduced yet, so we cannot use it when choosing yo. That is the key 
difference between the two statements. In fact, the current statement is false; in English, it 
says that “there is an integer larger than every integer.” 

Now, since we think the statement is false, we ought to be able to disprove it. We do 
so by proving a useful denial of the statement. Applying the denial rules, we must prove 
Vy € Z,Jx € Z,y < x. This denial resembles the original example, since it starts with V 
followed by 3. To prove the denial, we fix an arbitrary integer yo, and prove Jx € Z, yo < 2. 
We now choose xo to be yo, which works since yo € Z and yo < yo. We could have also 
chosen zo to be yo + 1 or yo + 2, for instance. Note that zo is allowed to depend on the 
previously introduced constant yo. 


In general, the statement Jy,Vxz,P(z,y) is harder to prove than the statement 
Va, dy, P(x,y), because the y we choose is allowed to depend on x in the latter state- 
ment, but y cannot depend on x in the former statement. Nevertheless, statements of the 
first type are sometimes true, as we see next. 


2.68. Example. Prove or disprove: dy € Z, Vx € Z,y < 27. 


Proof. The existential quantifier comes first, so we must begin by choosing y. We choose 
yo = —1, which is a specific integer. We must prove Vz € Z,—1 < x”. Let xo be a fixed, 
but arbitrary integer. We must prove —1 < 22. By algebra, 72 > 0 > —1. [We could give a 
more detailed proof that x2 > 0 by looking at the two cases where xp > 0 or xo < 0, and 
using properties of inequalities to deduce x > 0 in each case.| 
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2.69. Example. Prove or disprove: Vz € Z, Jy € Z,y < 27. 


Proof. [This statement is true; we proved an even stronger statement in the previous ex- 
ample. To see why, let us prove the current statement.] Let xo be a fixed, but arbitrary 
integer. We must prove Jy € Z,y < x2. Choose yo to be —1 [note that yo is allowed to 
depend on xo, but our choice is not required to use xo]. We know —1 € Z and —1 < 0 < a, 
as before. H 


The last two examples illustrate the following general fact. 


2.70. Theorem. For any open sentence P(x,y) and any universes U and V, 


Jy € U,Yx € V, P(x,y) = Vee V, Jy € U,P(z,y). 


We prove this theorem in the next section. 


Proving Statements with Multiple Quantifiers 


It is very common in mathematics to have statements in which three or more quantifiers 
(some existential, some universal) appear at the beginning of a complex statement. The 
next example illustrates how the rules above generalize to such a situation. The main fact 
to remember is that when proving an existential statement by choosing a value for the 
variable, that value is only allowed to depend on all previously introduced constants, which 
correspond to quantified variables appearing to the left of the given variable. 


2.71. Example. Outline the proof of this statement: 


Vu, V6, dw, Yx, dy, Yz, P(u, 6, w, x, y, z). 


The second quantified variable is the Greek letter ô (delta). 
Solution. We follow the proof templates, processing the quantifiers from left to right, ob- 
taining the following lines in this order: 


Let uo be a fixed, but arbitrary object. 

Let ôo be a fixed, but arbitrary object. 

Choose wo = (...) [some expression that may involve wo and do]. 

Let xo be a fixed, but arbitrary object. 

Choose yo = (...) [some expression that may involve uo, ĝo, wo and xo]. 
Let zo be a fixed, but arbitrary object. 

Prove P(ug, 60, Wo; £o, Yo; Z0)- 


Suppose someone tries to shorten the preceding text by writing the following variation 
of the proof outline: 


Let uo, 60, £o, and zo be fixed, but arbitrary objects. 
Choose wo = (...) and yo = (...). 
Prove P(uo, ôo, Wo, £0, Yo; zo). 


This variation is not correct, since the order in which variables are introduced indicates 
that wo and yo might potentially depend on all the previously mentioned constants uo, 60, 
Zo, and zo. In fact, the alternate proof outline would prove the statement 


Vu, Vô, Va,Vz, IW, J39, Plu, ô, w, x, Y, z), 


which is weaker than the original statement in general. 
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2.72. Example. Prove: Vz € R,Vy € R, if x < y, then dz E R, £ < 2 < y. 

Proof. Let x9 and yo be fixed, but arbitrary real numbers. Assume xg < Yo. Prove 3z € 
R, zo < z < yo. Intuitively, the average of xp and yo should lie between these two numbers. 
So, we choose zo = (xo + yo)/2, which is a real number since R is closed under addition and 
division by 2. We must prove £o < zo < yo, which is really the AND-statement £o < zp and 
zo < yo. To do this, divide the assumed inequality zo < yo by 2 to get 2/2 < yo/2. On 
one hand, adding xo/2 to both sides produces x9 /2 + 19/2 < 20/2 + yo/2, so zo < zo. On 
the other hand, adding yg/2 to both sides produces 29/2 + yo/2 < yo/2+ yo/2, 80 zo < Yo. 
This completes the proof. 


Section Summary 


1. Proof by Exhaustion. To prove Va € U, P(x), where U is the finite set {21,..., Zn}: 
Prove P(z,). Prove P(z2).... Prove P(z,). 


2. Generic-Element Template for Universal Statements. To prove Vx € U, P(x): 
Step 1. Let xo be a fixed, but arbitrary, object in U. 
Step 2. Prove P(zo). 
Here xo is a new letter denoting a generic constant. We cannot choose a specific 
value for zg. We have no specific initial knowledge about xo, except we know 
zo € U. Step 1 is often abbreviated by saying: “fix a € U.” 


3. Constructive Proof of 3x € U, P(x). 
Choose a specific object £o; prove zo is in U; prove P(x). 
In contrast to the generic-element method, we must choose a specific object £o 
here. This object is allowed to depend on constants previously introduced in the 
proof, but it cannot involve constants introduced later. 


4. Quantifier Reordering Rules. The following rules hold for all open sentences 
R(x, y): 
1. Va,Vy, R(x, y) s Vy, Yx, R(x, y). 
2. Ax, Jy, R(x, y) <> Jy, da, R(x, y). 
3. Jx, Vy, R(x,y) > Vy, da, R(x, y). 


The converse of rule 3 is not universally valid: in general, changing the order of 
the quantifiers 4 and V affects the meaning of the statement. 


5. Sample Proof Outlines of Statements with Mixed Quantifiers. Observe how differ- 
ences in the types and ordering of quantifiers leads to contrasting proof structures 
in the following examples. 


a) To prove Vz, Vy, 3z, P(x, y, z): Fix zo. Fix yo. Choose zp = (...) (an expression 
that can involve xo and yo). Prove P(20, yo, 20). 


b) To prove Va, 4z, Vy, P(x, y, z): Fix zo. Choose zp = (...) (an expression that 
can involve zo only). Fix yo. Prove P(xo, yo, Zo). 


c) To prove 4z, Yx, Vy, P(x, y, z): Choose zo = (...) (a specific object not depend- 
ing on x or y). Fix zo. Fix yo. Prove P(0, yo, 20). 


d) To disprove Vz, 4z,Vy, P(x,y, z): Prove 3x, Yz, Jy, ~P(z, y, z). Choose £to = 
...) (a specific object not depending on anything else). Fix zg. Choose yo = (...) 
an expression that can involve zo and 29). Prove ~P(20, yo, zo). 
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Exercises 


1. Prove each universal statement. 
(a) For all x,y € Z, if x divides y, then x? divides y?. 
(b) For all x € {1,3,5,7}, 8 divides x? — 1. 
(c) For all xz € R, if x’ ¢ Q, then z ¢ Q. 


2. Give proof outlines for each statement similar to those in summary item 5 on 


page 86. 
(a) Yr, Ys, St, Su, Q(r, s,t, u). 
b) ‚3t, Vr, du, Q(r, S, t, u). 


(b) Ys 

(c) Vs, St, du, Vr, Q(r, s, t, u). 

(d) Ju, St, Vr, Vs, Q(r, s, t, u). 

(e) Find all pairs of statements in (a) through (d) such that the first statement 
always implies the second statement. 


3. Outline disproofs of statements (a) through (d) in the previous exercise. 


4. True or false? Prove each true statement, and disprove each false statement. 
a) Jy € Z, Yz E Z, xy = 0. 

Va € R, Jy € R, zy = 1. 

Jy € R, Vx € R, zy = 1. 

Va € Rso, dy € Rso,y < T. 

Jy € Rso, Vx € Rso,y < T. 

f) Vx € Z, dy € Z>0, 2? = y?. 

5. Prove or disprove each statement. 
a) Im € Z, Yn € Z, mn is even. 
b) dm € Z, Yn € Z, mn is odd. 

c) Yn € Z, Im € Z, mn is even. 
d) Yn € Z, Jm € Z, mn is odd. 

e) Yn € Z, Im € Z, m +n is odd. 


6. Prove or disprove each statement. 

a) Ja € Z, Vb € Z, a divides b. 

b) Ja € Z,Vb € Z, b divides a. 

c) Va € Z, Vb € Z, Ac € Z, a divides b + c. 
d) Va € Z, Jc € Z, Vb € Z, a divides b + c. 
e) Va € Z, Vb € Z, Ac € Z, b + c divides a. 
f) Va € Z, 3c € Z, Vb € Z, b + c divides a. 


7. Prove or disprove each statement. 

a) Jx € R, Vy E€ R, zy is rational. 

b) Va € R, Jy € R, x + y is rational. 

c) Yq € Q, Jn € Z>o,nq E Z. 

d) In € Z>0, Yq E Q, nq E Z. 

e) Jq E€ Q, Yn E€ Z, nq E Z. 

8. Prove or disprove each statement. 

a) Yz € R, Jy E€ R, y ALA |z| = |yl. 

dz € R, vw € R, |z| < [wl]. 

c) Yx € Ryo, 3y E€ R, |x + y| 4 |æ| + lyl. 
Jy € R, Yz € Ryo, |æ + y| £ |e| + lyl. 


b) 
c) 
d) 
e) 


u 


Lu 


Prove: Vy € R, da € R,y = 2a — 7. 
Disprove: dz € R,Vy € R, y = 2a — 7. 
c) Disprove: Vy € Z, dx € Z, y = 2x — 7. 
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10. 


11. 


12. 


13. 


14. 


15. 


16. 


17. 


18. 


19. 


20. 


An Introduction to Mathematical Proofs 


a) Prove: Vy € R43, dr E€ Rys, £y — 5y — 34 + 13 = 0. 

b) Prove: Vx € Rys5, dy E Rz3, xy — 5y — 3x + 13 = 0. 

c) Prove: dz € R, 3b € R,Vy E R, zy — 5y — 3x + 13 = b. 

Prove or disprove the following statements, which are variations of Example 2.72. 
a) Va € R, 3z CR Vy ECR ar<ysu<z<y. 

b) Va ER, Vz E R, Jy ER, <y>r<z<y. 

c) Yz € R,Yz € R, Aye Ra<zS>u<2<y. 

d) 3z € R, Yx € Reo, Vy E R>o,£ < Z < y. 

e) Yx € Q, Yy E€ Q, 3z EQ, zr <z Sy Vy[<z<r. 


Prove or disprove each statement. 


a) Vx € R, Vy ,3zER,z+y+z=0. 
b) Vz € R, Jy € R,Vz ER,z +y +z =0. 
c) Ix € R, Yy E€ R,Yz E R, z +y +2 =0. 
d) Sy € R, Yz € R, Av € R, £ +yz = 3. 
e) Ix € R, Jy € R, Yz E R, z +yz = 3. 


When choosing objects in this problem, indicate explicitly which other objects 
the chosen object may depend upon. 

(a) Outline a proof of this statement: 

Ve € Rso, Vz € R, 3ô € R>0, Yy € R, |z — y| < ô => | f(x) — fl(y)| < €. 

(b) Outline a proof of this statement: 

Ve € Rso, dd € R>0, Yz € R, Yy € R, |z — y| < ô => | f(x) — f(y)| < €. 

(c) Outline a disproof of the statement in (b). 

[Remark: (a) is the definition of continuity of the function f, whereas (b) is the 
definition of uniform continuity of the function f. These concepts are studied in 
advanced calculus.] 


Let (a1, a2,...,@m,..-) be a fixed sequence of real numbers. Let L € R be fixed. 
a) Outline a proof of: Ve € R>0, IN E Zso, Vm E Zso,(m > N > jam- L| < €). 
b) Outline a disproof of the statement in (a). 


) 
a) Outline a proof of: de € G, Vx € G, (exx = x ^ Jy E€ G,yxz =e). 
b) Outline a disproof of the statement in (a). 


a) Prove or disprove the statement in part (a) of the previous problem, taking 
G = Z, and replacing * by + (integer addition). 

b) Repeat (a) taking G = R and replacing x by - (real multiplication). 

c) Repeat (a) taking G = Qso and replacing * by - (multiplication of rational 
numbers). 


a) Is da, dy, P(x,y) equivalent to dx, dy, P(y, x) for all P(x,y)? Explain. 
b) Is Va, Vy, P(x,y) equivalent to Vz, Vy, P(y, x) for all P(x,y)? Explain. 
c) Is Va, dy, P(x,y) equivalent to Vx, 3y, P(y, x) for all P(x,y)? Explain. 


Create your own proof template to prove a uniqueness statement of the form 
d!a € U, P(x). [Hint: Use the elimination rule for the uniqueness symbol, and 
then combine several previously discussed proof templates to outline a proof of 
the new statement.| 


Use the template in the previous problem to prove each uniqueness statement. 
(a) dla € R,3x +5 = 0. 

(b) Vy € R, Va € Ryo, Vb € R,3!x € R,axz +b = y. 

(c) d!a € Z>o, z? — 5x — 14 = 0. 


Let P(x,y) be a fixed open sentence. 
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26. 


27. 
28. 


29. 
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(a) Outline a proof of: Vz € R,3!y € R, P(x,y). 

(b) Outline a disproof of the statement in (a). 

For each choice of P(x,y), prove or disprove the statement in part (a) of the 
previous problem. (a) P(x,y) is “y = x.” (b) P(x,y) is “x = y?.” (c) P(x,y) is 
“x= y8.” (d) P(x,y) is “ry = 1.” (e) P(x,y) is “t? +y? =4Ay> 0.” 

Let S be a fixed set of real numbers. 

(a) Outline a proof of: dy € R, (Va € S,x < y)AWz E€ R, z < y > WE S, z < u)]. 
(b) Outline a disproof of the statement in (a). 


For each choice of the set S, prove or disprove the statement in part (a) of the 
previous problem. (a) S = R. (b) S is the set of x € R with 0 < x < 1. (c) S is 
the set of x € R with x < 3. (d) S is the set of real numbers of the form 2 — 1/n 
for some n € Zo. (e) S is the empty set. (f) S = Z. 

Prove Ve € Ryo, 46 € Rso, Vz € R, Vy € R, |z — y| < 6 => |(82+5) — (3y + 5)| < € 
by completing the outline in part (b) of Exercise 13. 

Prove Ve € Ryo, Vz € R,Jô € Ryo, Vy € R, |x — y| < 6 = |r? — y?| < € by 
completing the outline in part (a) of Exercise 13. 

Disprove Ve € Ryo, 4d € Ryo, Vx € R, Vy € R, |x — y| < 6 = |x? — y?| < € by 
completing the outline in part (c) of Exercise 13. 


Prove: Ve € R>0, Vx € R>0, 5d € Rso, Vy € R>0,|£—y| < 6 > |(1/x)—-(1/y)| < €. 


Disprove: 


Ve € R>0, 38 € R>0, Va € R>0, Vy € Rso, |x — y| < 6 = |(1/x) — (1/y)| < €. 


(a) Prove: 3!y € R,J!x € R, x? = y. 
(b) Prove or disprove: 3!x € R,3!y € R, z? = y. 
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2.7 More Quantifier Properties and Proofs (Optional) 


We begin this section with some logical rules showing how the quantifiers V and J interact 
with propositional operators such as A and V. After some informal discussion, we use the 
proof templates to prove these rules and some other properties stated in the last section. 
These proofs highlight the difference between how multiple quantifiers are treated in known 
statements vs. statements to be proved. The material presented here is a bit more abstract 
and challenging compared to earlier sections, so I have designated this section as optional. 
But it is a great opportunity to understand, at a deeper level, the nuances of how quantifiers 
and proof templates work. 


How Quantifiers Interact with Propositional Operators 


We have remarked that the universal quantifier V can be viewed as a generalization of the 
logical AND operator A, whereas the existential quantifier 4 generalizes the OR operator 
V. This observation may give some algebraic intuition for the following result. 


2.73. Theorem. For all open sentences P(x) and Q(x) and all universes U: 
(a) Va € U,[P(xz) AQ(z)] & (Wye U,P(y)) A (Yz € U, Q(z). 
(b) 3x € U,[P(x) VQ(a)] S&S (Ay EU, Ply)) v (Ez € U, Q(z)). 


We give an intuitive explanation for this theorem here, followed by a formal proof later 
in this section. The universal statement on the left side of (a) asserts that every object c 
in U makes both P(c) and Q(c) true. The right side of statement (a) is an AND-statement 
asserting that, first, every object c in U makes P(c) true; and second, every object c in U 
makes Q(c) true. Intuitively, the two sides of statement (a) mean the same thing, so they 
should be logically equivalent. Statement (b) has a similar intuitive justification: the left 
side says that at least one object c in U makes P(c) true or Q(c) true; the right side says 
that either some object c in U makes P(c) true, or some object c in U makes Q(c) true. 

Next we present two more subtle rules, involving the interaction of V and V on one hand, 
and the interaction of 3 and A on the other hand. 


2.74. Theorem. For all open sentences P(x) and Q(x) and all universes U: 

(a) [Wy € U, P(y)) V Vz €U,Q(z))]| = Wa € U, (P(x) v Q(2))]. 

(b) Bz E€ U, (P(£) ^ Q(2)] = [Gy € U, P(y)) A Gz € U, Q(2))]. 

Warning: The converse of (a) is NOT always true; the converse of (b) is NOT always true. 


Let us begin with specific examples illustrating the failure of the converse statements. 
For statement (a), let U be the set of all humans, let P(y) mean “y is female,” and let Q(z) 
mean “z is male.” The statement Vy € U, P(y) says that every human is female, which is 
false. The statement Vz € U,Q(z) says that every human is male, which is also false. So 
the OR-statement appearing as the hypothesis of statement (a) is false in this example. On 
the other hand, for any fixed, particular human zo, the statement that “xo is female or xo 
is male” is true, so the universal statement appearing as the conclusion of statement (a) is 
true in this example. So, statement (a) is indeed true (as F > T is true), but the converse 
of statement (a) is false in this example (as T => F is false). 

For statement (b), we give a more mathematical example. Let U = Z, let P(x) mean 
“x > 0,” and let Q(x) mean “x < 0.” Does there exist at least one fixed integer x) making the 
statement “zo > 0 and zo < 0” true? The answer is no, so the hypothesis of statement (b) 
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is false in this example. On the other hand, consider the AND-statement in the conclusion 
of statement (b). Does there exist an integer yg such that yo > 0? Yes, for example, yo = 1. 
So the first half of the AND-statement is true. Does there exist an integer zp such that 
zo < 0? Yes, for example, zo = —1. So the second half of the AND-statement is true. Thus, 
statement (b) has a true conclusion. We see now that statement (b) is true but its converse 
is false, in this example. 

We now give an informal justification of Theorem 2.74; see below for a formal proof. 
Intuitively, the hypothesis of (a) demands that one of two very strong conditions hold: 
either every object in the universe satisfies P, or every object in the universe satisfies Q. 
The conclusion of statement (a) is the much weaker assertion that for each individual object, 
considered one at a time, that object satisfies either P or Q. Which one of the two properties 
is true can vary from object to object. 

Here is intuition for statement (b). The hypothesis of statement (b) is the rather strong 
statement that we can find one particular object in U that simultaneously satisfies both P 
and Q. The conclusion of statement (b) only asserts that some object in U satisfies P, and 
some (possibly different) object in U satisfies Q. 


Proofs of the Quantifier Interaction Properties 


We now use the proof templates to generate formal proofs of Theorem 2.73 and 2.74. Recall 
that letters with subscripts (like zo) denote constants, whereas the corresponding letter 
without a subscript is a quantified variable. Throughout these proofs, we let P(x) and Q(z) 
be fixed, but arbitrary, open sentences, and let U be a fixed universe. [Note that we have just 
used the generic-element proof template three times to deal with the universal quantifiers 
that begin the theorem statements.| 


2.75. Proof of Va €U,[P(x)\ Q(x) = (Vy Ee U,P(y)) A (Yz € U, Q(2)). 
[This is an IFF-statement, so we need a two-part proof.] 


Part 1. Assume Vz € U, [P(x) A Q(az)]. Prove (Vy € U, P(y)) A (Vz € U, Q(z)). 
[The new goal is an AND-statement, so we prove each part separately.] 

First, prove Vy € U, P(y). Fix an arbitrary object yo E€ U. Prove P(yo). To do so, take 
Xx = yo in the assumed universal statement (we are using the Inference Rule for ALL here). 
The assumption tells us that P(yo) A Q(yo) is true. By definition of AND, P(yo) is true, as 
needed. 

Second, prove Vz € U,Q(z). Fix an arbitrary object zọ € U. Prove Q(z). To do so, 
take x = z in the assumed universal statement (using the Inference Rule for ALL). The 
assumption tells us that P(z.) A Q(zo) is true. By definition of AND, Q(zq) is true, as 
needed. Part 1 is now complete. 


Part 2. Assume (Vy € U, P(y)) A (Vz € U, Q(z)). Prove Vz € U, [P(x) A Q(2)]. 

[The goal of part 2 is a universal statement, so we use the generic-element proof template.] 
Let xo be a fixed, but arbitrary element of U. We must prove P(xo) A Q(xo). To prove this 
AND-statement, we first prove P(g), and then prove Q(x). Note that the AND-statement 
we have assumed tells us that Vy € U, P(y) is true, and Vz € U, Q(z) is true. Taking y = £o 
in the first of these statements, we see that P(xo) is true by the Inference Rule for ALL. 
Taking z = xp in the second of these statements, we see that Q(zo) is true by the Inference 
Rule for ALL. Part 2 is now complete. 


2.76. Proof of Jx € U, [P(x)v Q(x) = (ay € U,P(y)) v Gz €U,Q(z)). 
We could prove this statement from first principles, but it is quicker to use the statement 
we have just proved, namely Theorem 2.73(a). Since this statement was proved for all open 
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sentences P(x) and Q(x), we can replace P(x) by ~P(x) and Q(x) by ~Q(z) in part (a). 
We see that we already know this fact: 


Va € U,[(~P(x)) A (~Q(x))] = Wy € U, ~P(y)) A (Yz € U, ~Q(2)]. 


Recall the logical equivalence A @ B = (~A) © (~B). We can use this equivalence to 
transform the fact written above by negating the statements on the left and right sides of 
the & symbol. Using the denial rules to further simplify each side, we eventually get 


dr € U,|P(@)VQ(@)| & (3y € U, Ply) V Gz € U, Q(2)), 


which is precisely the result we needed to prove. 


2.77. Proof of [(Vy € U, P(y)) v (Yz €U,Q(z))] = [Mz E U, (P(x) v Q(x))]. 

[We give a direct proof of the IF-statement.] 

Assume (Vy € U, P(y)) V (Yz € U, Q(z)). Prove Va € U, (P(x) V Q(x)). 

[We have just assumed an OR-statement, so use proof by cases.] 

Case 1. Assume Vy € U, P(y). Prove Va € U, (P(x) V Q(x)). Let £o be a fixed, but arbitrary 
element of U. Prove P(xo) V Q(xo). Taking y = xo in the assumption, the Inference Rule 
for ALL tells us that P(xo) is true. Now, by definition of OR, “P(xo) V Q(xo)” must also 
be true. 

Case 2. Assume Vz € U, Q(z). Prove Vx € U, (P(x) V Q(x)). Let £o be a fixed, but arbitrary 
element of U. Prove P(xzo) V Q(ao). Taking z = zo in the assumption, the Inference Rule 
for ALL tells us that Q(x) is true. Now, by definition of OR, “P(xo) V Q(xo)” must also 
be true. 

So the needed conclusion holds in all cases. 


We could deduce Theorem 2.74(b) by taking the contrapositive of what we just proved 
and simplifying (see the exercises), but proving part (b) from scratch is also instructive. 


2.78. Proof of [Az € U, (P(x) AQ(z))] = [Gy € U, Ply)) A (Az € U, Q(z))). 
Assume Ja € U, P(x) A Q(x). Our assumption means there is at least one constant object 
Xo in U such that P(x) A Q(z) is true [we are using existential instantiation (EI) to give a 
name Zo to the object whose existence is assumed; see page 59]. Prove (Sy € U, P(y))A(az € 
U, Q(z)). [To prove an AND-statement, we prove each separate part.] We must first prove 
dy € U, P(y). Choose yo = Xo, which is in U. By definition of AND, P(a) is true. We must 
next prove dz € U,Q(z). Choose zo = zo, which is in U. By definition of AND, Q(zxo) is 
true. 


The reader is invited to attempt to prove the converse of each statement in Theorem 2.74 
for general open sentences P(x) and Q(z), to see exactly where the proof templates get stuck. 


Proofs of Quantifier Reordering Rules 


This section provides proofs of the rules for reordering quantifiers, which were stated in the 
previous section. Throughout this section, we let x and y be arbitrary distinct variables, 
and we let P(x,y) be a fixed, but arbitrary, open sentence. 


2.79. Proof of Vz,Vy,P(z,y) <= Vy,Vx,P(2,y). 

[This is an IFF-statement, so we do a two-part proof.] 

Part 1. Assume Vz, Vy, P(x,y). Prove Vy, Vx, P(x,y). 

[The new goal is a universal statement, so use the generic-element template. 

Let yo be a fixed, but arbitrary object. Let xo be a fixed, but arbitrary object. Prove 
P(2o, yo) is true. [To continue, note that the assumption is a known universal statement, so 
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we can use the Inference Rule for ALL, abbreviated IRA.] In the assumption, take x to be 
the object zo to deduce (by IRA) Vy, P(ao, y). Then take y to be the object yo to deduce 
(by IRA) that P(zo, yo) is true. This completes part 1. We have now proved 


Vx, Vy, P(x,y) = Vy,Va,P(z,y). 


Part 2. [This part is almost identical to part 1, so we let you fill in the details. Alternatively, 
the IF-statement to be proved in part 2 follows from the IF-statement already proved in 
part 1 by interchanging x and y, after replacing P(x, y) by P(y,2).] 


In the exercises below, we ask you to prove 


dae, Jy, P(x,y) <= Jy, 3x, P(x,y) (2.5) 


in two ways. 


2.80. Proof of [Az,Vy, P(z,y)| => Wy, 3x, P(x,y). 

Assume Jz, Vy, P(x,y). Prove Vy, da, P(x,y). 

By Existential Instantiation (EI), the assumption tells us there is some fixed object xo that 
makes Vy, P(xo,y) true. To begin proving the new goal, let yo be a fixed, but arbitrary 
object. We must show 3x, P(x, yo). We choose x here to be £o; so now we must show 
P(xo, yo) is true. This follows by applying the Inference Rule for ALL to Vy, P(2o,y), 
letting y be the object yo. 


To help understand what happened in that last proof, let’s try to prove the converse 
and see what goes wrong. 


2.81. Attempted Proof of [ẹVy, 3x, P(x,y) = [Aa,Vy, P(x, y))]. 

Assume Vy, Ja, P(x,y). Prove dx, Vy, P(x,y). [Here, at the very beginning of the proof, we 
are required to choose a specific object 2) making Vy, P(zo, y) true. What are we going 
to choose? The assumption cannot help us unless we can figure out a helpful constant to 
substitute for y using the Inference Rule for ALL (IRA). Suppose we blindly try y = yo, 
where yo is some particular object. By IRA and then EI, the assumption does give us an 
object xo making P(xo, yo) true for this particular yo. But now, to prove Vy, P(xo,y), we 
need to fix a new arbitrary object y1, which need not equal the fixed object yo selected 
earlier in the proof. So there is no obvious way to complete the proof. We have already seen 
examples where the original statement fails, so we will not be able to find a proof. 


The moral of this section is that the relative ordering of quantifiers and other logical 
symbols often has a big impact on the meaning of the statement. So great care is needed 
when working with a statement containing multiple quantifiers. 


Section Summary 


1. Quantifier Rules. The following rules hold for all open sentences P(x) and Q(x): 


Va € U,[P(x) A Q(2)] < (Yy € U, Ply)) A (Yz € U, Q(2)). 
Jx € U, [P(x) v Q(x)] S (3y € U, Ply)) v (Az € U, Q(2)). 

(vy € U, P(y)) V (Wz €U,Q(z))]) = [Mz €U, (P(z) v Q(x))] 
Be € U,(P(z)AQ(z))]) = [By € U, P(y)) A Gz € U,Q(z))]. 


In the last two rules, there exist open sentences for which the converse statements 
are false. 
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2. Proofs Involving Multiple Quantifiers. When proving complex statements involv- 
ing multiple quantifiers, process the quantifiers in the order they are encountered 
using the proof templates for J and Y. To use known statements involving quan- 
tifiers, carefully apply Existential Instantiation (EI) and the Inference Rule for 
ALL (IRA). Be sure to distinguish quantified variables from constants and note 
whether constants are arbitrary or chosen by the proof writer. 


Exercises 


1. Is each statement true or false? Explain each answer. 
a) Yy E R,y > 0. 
b) Vz ER,z <0. 
c) (Vy E€ R, y > 0) S (Vz E R,z < 0). 
d) zo > 0 & zo < 0, where zg is a fixed real number. 
e) Va ER, (a >0Sa<0). 
f) For all open sentences P(x) and Q(x), 
Ve E€ R, (P(x) S Q(z)) iff (Vy eR, P(y)) e (Vz ER, Q(z)). 
2. Deduce part (b) of Theorem 2.74 from part (a) by taking the contrapositive and 
using the denial rules. 


3. Use proof templates to prove part (b) of Theorem 2.73 without using part (a). 


4. (a) Prove (2.5) as a consequence of the analogous result for reordering universal 
quantifiers. (b) Prove (2.5) directly, using proof templates for existential state- 
ments and existential instantiation. 


5. Prove: for all nonempty universes U and all open sentences P(x), 


[V2 €U,P(2)| => [Ax €U, P(x). 
6. (a) Prove: for all universes U and all open sentences P(x) and Q(x), 
[Va € U, (P(t) > Q()] = [Vy € U, P(y)) > (Yz € U, Q(z))). 
(b) Disprove: for all universes U and all open sentences P(x) and Q(x), 
(vy € U, P(y)) > (Wz EU,Q(z))) = Wr EU, (P(@) > Q(2))). 
7. Use part (a) of the previous problem and other known theorems to prove: 


Ve E€ U, (P(t) @ Qz))) = [Vy € U, P(y)) e (Yz € U, Q(z))). 


8. Try to prove the converse of each statement in Theorem 2.74 using the proof 
templates. Explain what goes wrong. 

9. Consider the two statements: 

i) Sr € U, (P(<) Q(z) Gi) Gy € U, P(y)) @ Gz € U, Q(2)) 

a) Does (i) imply (ii) in all situations? Prove your answer. 

b) Does (ii) imply (i) in all situations? Prove your answer. 


10. Consider the two statements: 

i) Va € U, (P(x) © Q(2)) (ii) (Yy € U, Ply) © (Yz € U, Q(z) 
a) Does (i) imply (ii) in all situations? Prove your answer. 

b) Does (ii) imply (i) in all situations? Prove your answer. 
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11. 


12. 


13. 


14. 


15. 


16. 


17. 


18. 


19. 


20. 
21. 
22. 
23. 
24. 
25. 


Consider the two statements: 

i) dx € U, (P(x) @ Q(x)) (ii) Gy € U, P(y)) © (3z € U, Q(z) 
a) Does (i) imply (ii) in all situations? Prove your answer. 

b) Does (ii) imply (i) in all situations? Prove your answer. 


Consider the two statements: 

i) Ix € U, (Pe) Q(x) (it) Gy € U, Py) + Gz €U,Q(2) 
a) Does (i) imply (ii) in all situations? Prove your answer. 

b) Does (ii) imply (i) in all situations? Prove your answer. 


a) Prove: for all propositions P and all open sentences Q(x), 

P > Yx, Q(<£)) & Va,(P > Q(2)). 

b) Disprove: for all propositions Q and all open sentences P(x), 

Yr, P(t) >Q © Vz2,(P(x) > Q). 

c) Can you find a way to modify the right half of (b) to produce a true statement? 
If so, prove your answer. 


Fix an arbitrary proposition P and an arbitrary open sentence Q(x). Prove: 


(a) [Va,(PAQ(a))] @ PAWy, QU). 

(b) V2, (PV Q(@))] @ Pv My, Ql). 

Fix an arbitrary proposition P and an arbitrary open sentence Q(x). Prove: 
(a) Er, (PAQ(a))] = PA By, Qu). 

(b) Bz, (Pv Q()] @ PV By, Q). 

Do this directly, or by using the previous exercise. 


Let U be a fixed, nonempty universe. Consider the two statements: 
i) Bx € U, (P@ Q(2))] (i) P® By €U,Qiy)]. 

a) Does (i) imply (ii) in all situations? Prove your answer. 

b) Does (ii) imply (i) in all situations? Prove your answer. 


Let U be a fixed, nonempty universe. Consider the two statements: 
i) We E€ U, (P & Q(@))] Gi) P & Wy € U, Q(y)]. 

a) Does (i) imply (ii) in all situations? Prove your answer. 

b) Does (ii) imply (i) in all situations? Prove your answer. 


Prove: for all nonempty universes U and all open sentences P(x), 
Jy € U, (P(y) > Vz € U, P(x)). 


Prove: for all nonempty universes U and all open sentences P(x), 
Vy € U, (P(y) = Jx € U, P(x)). 


Does 3! x, Yy, P(x,y) always imply Yy, 4! x, P(x,y)? Explain. 


Does 4! x, (P(x) V Q(x)) always imply [3 !y, P(y)] v [3! z, Q(z)]? Explain. 
Does [3 !y, P(y)] v [A! 2, Q(z)] always imply 4! x, (P(x) V Q(x))? Explain. 
Does 4! x, (P(x) A Q(x)) always imply [A! y, P(y)] A [3! z, Q(z)]? Explain. 


Does 4! 2,4! y, P(x,y) always imply 4! y,4! a, P(x,y)? Explain. 
y y y 3:9 


Which of the answers to the previous five problems change if every 4! is replaced 
by 3 in the conclusions? 
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Logical Symbols 


Complicated propositions are built up from simpler ones using the following symbols. 


Formal Expression | English Translation 


P is not true. 

P and Q. 

Por Q. 

If P, then Q. 

P if and only if Q. 

P or Q, but not both. 

For every xo in U, P(xo) is true. 


There exists at least one xo in U for which P(xoọ) is true. 
There exists exactly one x in U for which P(ao) is true. 


The meaning of ~, A, V, >, ©, and © is given by the following truth tables. 


Ai H Ba) 
AHA AHO 


Possible translations of P > Q include: 


if P, then Q. 


P is sufficient for Q. Q is necessary for P. Q whenever P. (not P) or Q. 


NOT AND OR IF IFF XOR 
~P PAQ PVQ P=Q PsSQ PQ 
F T T T T F 
F F T F F T 
T F T T F T 
T F F T T F 
P implies Q. Q if P. P only if Q. 


In mathematics, OR always means inclusive-OR (V). 


Denial Rules 


To form a useful denial of a given statement, repeatedly apply the following rules. 


Statement Denial of Statement | Symbolic Version of Rule 
A and B (denial of A) or (denial of B) | ~(AA B) = (~A) V (~B) 
AorB (denial of A) and (denial of B) | ~(A V B) = (~A) A (~B) 
if A, then B A and (denial of B) ~(A > B)=AA(~B) 
not A A (vA) =A 


For all x, P(x) 


There is x, (denial of P(x)) Vea € U, P(x) iff Sa € U, ~P(x) 


Note: the useful denial of an IF-statement is an AND-statement, not an IF-statement. 


There is x, P(x) | For all x, (denial of P(z)) ~da € U, P(x) iff Vx € U, ~P(x) 
A if B A or B, not both (AS B)= (AGB) 
A or B, not both | A iff B (AG B)=(ASB) 
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Example. A useful denial of 
“Ye > 0,46 > 0,Yx ER, (2-5 < x < 2+8) 53 (4-€< a? <4+6)” is 
de > 0,V6 > 0,42 € R, (2-6 < x < 2 + ô) and (4 — € > x? or z? > 4+6)” 


Propositional Logic 


1. Filling in Truth Tables. Make a column for each subexpression of the propositional 
form. Fill in each column using the defining truth tables for the logical operators 
(given above). Be sure to refer to the correct previous columns. Recall NOT flips 
true and false; AND outputs true precisely when both inputs are true; OR outputs 
false precisely when both inputs are false; IF (i.e., A = B) outputs false precisely 
when A is true and B is false; IFF outputs true precisely when the inputs agree; 
XOR outputs true precisely when the inputs disagree. 


2. Logical Equivalence. Two propositional forms P and Q are logically equivalent, 
written P = Q, iff all rows of the truth tables for P and Q match. Some frequently 
used logical equivalences appear in the Theorem on Logical Equivalences, the 
Theorem on IF, and the Theorem on IFF. 


3. Tautologies and Contradictions. 
A propositional form is a tautology iff all rows of its truth table have output true. 
A propositional form is a contradiction iff all rows of its truth table have output 
false. Every proposition is true or false; but most propositional forms are neither 
tautologies nor contradictions. 


4. Terminology for IF-statements. 

a) A > B is called a conditional with hypothesis A and conclusion B. 

b) The converse of A => B is B > A. 

c) The contrapositive of A > B is (~B) => (~A). 

d) A denial of A => B is AA (~B). 

e) The inverse of A > B is (~A) > (~B). 

f) “If A then B” is logically equivalent to its contrapositive. 

g) “If A then B” is not logically equivalent to its converse, its inverse, or its 
negation (in general). 


5. Eliminating >, &, and ®. Use these logical equivalences to rewrite IF, IFF, and 
XOR in terms of other logical symbols: 


(a) ASB = (~A)VB. 
(b) A&B = (ASB)A(B=A) = (AAB)V((KA)A(~B)). 
(c) ASB = (AA(B))V((KA)AB) = (AVB)A((AAB)). 


6. Inference Rules for IF. If P > Q is a known theorem and P is a known theorem, 
you can deduce Q as a new theorem. If P > Q is a known theorem and ~Q is a 
known theorem, you can deduce ~P as a new theorem. 


Quantifiers 


Let U be a fixed universe (nonempty set of objects). 


1. Restricted and Unrestricted Quantifiers. For all open sentences P(x), 
Va €U,P(x) if Vx,(x €U => P(a)); 
da € U, P(x) if Ar, (ae UA P(a)). 
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2. Quantifiers Rules and Pitfalls. For all open sentences P(x), Q(x), and R(x, y): 


(a) Va, P(x) = Vu, P(v) where v is a new letter. 
(b) da, P(x) = dv, P(v) where v is a new letter. 
(c) Va, Vy, R(x, y) < Vy, Va, R(x, y). 

(d) dz,ay,R(z,y) <= 3y,3r, R(z,y). 

(e) Jx, Vy, R(x, y) > Vy, da, R(x, y). 

(f) Vz € U, [P(x) A Q(z) < (Vy € U, P(y)) A (Vz € U, Q(2)). 
(g) dr €U,[P(@)VQ@)) = (yeU,Ply)) V (Ez €U,Q(z)). 
(h) [Wy € U, Py) V Vz €U,Q(z))) = WreU, (P(x) v Q(@))). 

(i) [Are U, (P(x) AQ] = [By €U, Ply) A (Gz € U,Q(z))).- 


The right side of (e) says for all y, there exists an x (which can be chosen differ- 
ently for different y) such that R(x, y) is true. The left side of (e) is the stronger 
statement that there is one choice of x (the same choice for all y) such that R(x, y) 
is true. To see that (h) cannot be strengthened to an IFF-statement, let U be 
the set of humans, let P(x) mean “x is male,” and let Q(x) mean “x is female.” 
To see that (i) cannot be strengthened to an IFF-statement, let U = R, let P(x) 
mean “x <0,” and let Q(x) mean “x > 0.” 


3. Contrasting Translations. Let B(x) mean x is black; let C(x) mean zx is a crow. 
“No crows are black” translates to Vx, C(x) >~B(z). 
“Not all crows are black” translates to Vz, C(x) > B(x). 
We can transform this translation to Jx, C (x)^ ~B(a). 
“All crows are not black” translates to Vz, C(x) >~B(z). 
“Black crows do not exist” translates to ~dz, C(x) A B(x). 
We can transform this translation to Vz, C(x) >~B(x). 


4. Inference Rule for ALL. If “Yx € U, P(x)” is a known theorem and “c € U” isa 
known theorem (where c is any expression), then you can deduce the new theorem 
P(c). 

5. Multiple Quantifier Example. To prove Va, dy, Vz, P(x,y, z): fix xo; choose an ap- 
propriate yo (which may depend on 29); fix zo; prove P(x, yo, zo). To disprove 
Va, dy, Yz, P(a2,y, z), prove dx, Vy, 3z, ~P(x, y, z) as follows. Choose an appropri- 
ate specific object xo; fix yo; choose an appropriate zy (which may depend on zo 
and yo); prove ~P(2o, Yo, Zo): 


6. Rule to Eliminate the Uniqueness Symbol. 


g!x € UỌ,P(<) & (AreU,P(x))AVa €U,Vy € U, (P(x) A Ply)) > « = yl. 


Proof Templates 


General Comments: 
(a) Memorize the proof templates! Practice the proof templates! 


(b) To decide which template to apply at each stage, look for the outermost logical operator 
in the current statement to be proved. 

(c) In each template, the numbered statements (1), (2), ... should be explicitly written down 
at the appropriate point in the proof. 
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(d) After writing these statements, continue the proof by either using more proof templates 
or expanding definitions in what you have just written. 

(e) Remember to introduce each sentence, formula, and new variable with logical status 
words (“assume,” “we must prove,” “we know,” “choose,” etc.). 


(£) Do not reuse a variable to mean two things. 
(g) Do not deduce consequences from an equation (or statement) that is yet to be proved. 


Instead, work forward from known and assumed statements to gather information leading 
toward the statement to be proved. 


. Direct Proof of “if P then Q.” (1) Assume P. (2) Prove Q. 

2. Contrapositive Proof of “if P then Q.” (1) Assume ~Q. (2) Prove ~P. 
[Use the denial rules to find useful denials of P and Q here.] 

3. Contradiction Proof of “if P then Q.” (1) Assume, to get a contradiction, “P 
and ~Q.” (2) Deduce a contradiction. 

4. Contradiction Proof of Any Statement P. (1) Assume, to get a contradiction, ~P 
[write a useful denial of P here]. (2) Use this assumption, definitions, and known 
results to produce a contradiction (such as S A (~S)). (3) Conclude that P is 
true. 

5. Two-Part Proof of “P iff Q.” (1) Prove “if P then Q” by any method. (2) Prove 
“if Q then P” by any method. 

6. Constructive Proof of 3x € U, P(x). (1) Describe a specific object xp that will 
make P(x) true (we often say: “choose rp = ...”). (2) Prove a € U. (3) Prove 
P(x). Note zo is allowed to depend on constants introduced earlier in the proof. 

7. Generic-Element Proof of Vx € U,P(a). (1) Fix an arbitrary object xp € U. 
(2) Prove P(xo). (It is not correct to choose a specific object in U here; step (2) 
must work for generic elements of U.) 

8. Proof by Cases. To prove Q when you already know P; V P> V---V Px: (0) Say: 
“We know Pı or P or ... or Py, so use cases.” 

(1) Case 1. Assume P}. Prove Q. 
(2) Case 2. Assume Pz. Prove Q. 


(k) Case k. Assume Py. Prove Q. (Thus Q must be proved k times, where there 
is a different assumption each time.) 


9. Proving P AQ. (1) Prove P. (2) Prove Q. 
10. Proving PV Q. (1) Assume ~P. (2) Prove Q. 
11. Proving P®Q. Prove P & (~Q). 


12. Proving ~P. (1) Use the denial rules to find a useful denial of P not starting with 
NOT. (2) Prove the denial found in (1). 


13. Disproving P when P is False. Prove ~P (see previous item). 


14. Proof by Exhaustion. If U = {z1,...,2n} is a finite set, you can prove Yx € U, P(x) 
by proving “P(z1) and P(z2) and ... and P(zn).” 


Review Problems 


1. Complete the following definitions: (a) for a fixed integer c, c divides c? + 7 iff... 
(b) A real number r is rational iff... (c) A propositional form A is a contradiction 
iff... 
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2. 


10. 
11. 


12. 
13. 


14. 
15. 


Give a truth table for the propositional form ((~P) A Q) = (P v Q). Show all 
columns. Is this a tautology? 


Write seven different English phrases that could be used to translate P > Q. 


Let Q be the statement “If X is compact, then: X is separable or X is not closed.” 
(a) Write the converse of Q. (b) Write the contrapositive of Q. (c) Outline a direct 
proof of Q. (d) Outline a proof of Q by contradiction. 


True or false? 

a) 7 is odd or 7 is odd. 

b)14+1=2if0=1. 

c) 0 = 0 whenever 0 # 0. 

d) 5 € Z implies 0.37 € Z. 

) 0.5 is rational only if 8 is odd. 

) V2 ER is a necessary condition for v2 € Z. 

) 2 is odd iff 7 is even. 

) dia ER, (x? = 2). 

i) A useful denial of “4 < x < y” is “4 > x > y.” 

j) Every propositional form is either a tautology or a contradiction. 
k) The propositional form P @ P is a contradiction. 

1) (P A Q) V R is logically equivalent to PA (Q V R). 

m) Given that P > Q is a known theorem and Q is a known theorem, we may 
always deduce P as a new theorem. 

n) One way to prove P ®Q is to assume ~P and prove Q. 


a) Make truth tables for (P Q) > P and P & (Q > P). 
b) Are the two forms in (a) logically equivalent? Explain briefly. 
c) Is either form in (a) a tautology? Explain briefly. 


Let Q be the statement “If (X is compact and Hausdorff) or X is metrizable, 
then X is normal.” (a) Write the converse of Q. (b) Write the contrapositive of 
Q. (c) Outline a direct proof of Q. 

Consider the statement: (A A B) > (C ® D). 

(a) Outline a direct proof of this statement. 

(b) Outline a contrapositive proof of this statement. 

(c) Outline a proof by contradiction of this statement. 


Let W(x) mean “x is white,” R(x) mean “gx is a rose,” and P(x) mean “x is 
pretty.” (a) Using this notation, logical connectives, and unrestricted quantifiers, 
write this statement in symbolic form: “Although all white roses are pretty, some 
pretty roses are not white.” (b) Give a useful denial of the previous statement. 
Do not use symbols in your answer. 


Prove: for all x,y € R, if x € Q and y € Q and y £0, then z/y € Q. 


(a) Prove: For all integers x, if x? + 4 is even, then x is even. (b) Is the converse 
of (a) true? 


Prove: for all a,b,c,r,s € Z, if a divides b and a divides c, then a divides rb + sc. 


Outline a direct proof of this statement, assuming P, a, b are fixed objects: 
“if P is a prime ideal, then ab € P if a € P or be P?” 


(a) Prove: Yx € R, Jy € R,y > x + 2. (b) Disprove: dy € R, Vx E€ R, y > a +2. 


(a) Prove: Va € Ryo, dy € Rao, a < y < 2x. 
(b) Disprove: dy € Ryo, Va E€ Rso, x < y < 2z. 
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16. 
Vt: 


18. 


19. 


20. 
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Prove: for all r € R, r is rational iff 3r + 5 is rational. 


Disprove: for all a,b,c,r,s € Z, if a divides rb + sc, then a divides b or a divides 
c or a divides r or a divides s. 


(a) Prove: for all a,b,c € Z, if a divides b or a divides c, then a divides bc?. 
(b) Prove or disprove the converse of (a). 


Show that P > (Q A^ R) is logically equivalent to (P > Q) A (P = R) using 
known logical equivalences, not truth tables. 


Give a proof by contradiction of this statement: 
For all b,c € Z, if bc? is even, then b is even or c is even. 


Sets 


3.1 Set Operations and Subset Proofs 


At this point, we have introduced nearly all of the fundamental rules for generating proofs 
of logical statements. (The main rule yet to be discussed is mathematical induction, which 
we cover in Chapter 4.) Armed with our knowledge of logic and proofs, we can now proceed 
to study whatever mathematical theories we are interested in: set theory, number theory, 
graph theory, abstract algebra, analysis, probability theory, complexity theory, and so on. 
Surprisingly, the theory of sets turns out to be a foundational subject on which every other 
branch of mathematics can be built. Many aspects of set theory relate very closely to the 
propositional logic and quantifier logic that we have just studied. So, we continue our journey 
up the tower of mathematics by examining elementary set theory. 


Informal Introduction to Set Theory 


Most readers of this text have already seen some aspects of set theory at an intuitive level. 
We begin by introducing the basic ideas intuitively, to motivate the formal definitions given 
below. Informally, a set is any collection of objects. For example, {a, b,c, d} is a set of four 
letters; Z is the set of all integers; [0,3] is the set of all real numbers x satisfying 0 < x < 3, 
and so on. If x is an object and S is a set, we write x € S to mean that x is a member of 
S; x S means that x is not a member of S. For example, 3 € Z, b € {a,b,c,d}, 5 ¢ [0,3], 
and so on. (We already used this notation in our discussion of restricted quantifiers.) The 
symbol x € S can be read aloud as “x is a member of S” or “x belongs to S” or “x is in 
S.” Beginners sometimes read the symbol x € S' as “x exists in S,” but this phrase should 
not be used, as it suggests an existential quantifier that is not present. 

The members of a set can be any objects whatsoever, including other sets. For example, 
{0, 1, 2,3, Z, [0, 3]} is a set with six members: the four integers 0, 1, 2, and 3; the set Z of all 
integers; and the closed interval [0, 3], which is a set of real numbers. The following diagram, 
called a Venn diagram, shows two geometric sets A and B; the members of A are the points 
on this page lying within the left circle. 


A B 


We can combine sets to form new sets using various set operations. Let A and B be 
any two sets. The union of A and B, denoted AU B, is formed by lumping together all the 
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members of A and B into one big set. Symbolically, AU B = {a : a € A or x € B}. For 
example, {1, 4,5, 7}U {2, 3,5, 7,11} = {1, 2,3, 4,5, 7,11}. For A and B in the Venn diagram 
above, AU B is the shaded region below consisting of all points lying within at least one of 
the two circles: 


The intersection of A and B, denoted AN B, consists of all objects that are members 
of both A and B. Symbolically, AN B = {a : a € A and «x € B}. Continuing our previous 
example, {1,4,5,7} {2, 3,5, 7,11} = {5,7}. The Venn diagram for AN B looks like this: 


S 


The set difference A — B is the set of all members of A that are not members of B. 
Symbolically, A — B = {x : x € A and z ¢ B}. For example, {1,4,5,7} — {2,3,5,7,11} = 
{1,4} and {2,3,5,7,11} — {1,4,5,7} = {2,3,11}. Here is the Venn diagram for A — B: 


We say that A is a subset of B, written A C B, to mean that every member of A 
is also a member of B. The notation A Z B means that A is not a subset of B. For 
example, {1,4,5,7} Z {2,3,5,7,11} since 1 is a member of {1, 4,5, 7} that is not a member 
of {2,3,5,7,11}. On the other hand, {1,4,5,7} C {1,2,3,4,5,6,7}, {1,4,5,7} C Z, and 
Z C Q CR. In the Venn diagrams displayed above, A Z B and B Z A, since each of 
the two circles contains some points not in the other circle. In the following Venn diagram, 
CCDbut DZC: 


© 
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It is possible for a set to have no members whatsoever. Such a set is called an empty 
set, and it is denoted by the symbol Ø or {} (a set of curly braces with nothing inside). Be 
warned that these two notations for the empty set must not be combined: {Ø} is a set with 
one member (namely, the empty set), so {Ø} is not the same set as the empty set. We give 
a more detailed discussion of this point later, after introducing the appropriate definitions. 


Formal Definitions of Subsets and Set Operations 


We now begin afresh with a formal development of set theory, which is motivated by the 
informal discussion in the previous section. In our theory, the following words and symbols 
are undefined terms (see §2.1): set, membership in a set, x € S. Every other concept in 
set theory will be defined from these undefined terms and the logical operators discussed 
earlier. We begin with the formal definition of subsets. 


3.1. Definition: Subsets. For any sets A and B,| AC Byjiff}Vz,2e AS>ae Bl. 


The sentence in this definition is a new definitional axiom that lets us replace the newly 
defined symbol A C B (“A is a subset of B”) by the definition text appearing after the 
word “iff,” and vice versa. Since P © Q is equivalent to (~P) = (~Q), we can obtain a 
definition of A Z B by applying the denial rules. We get: 


For any sets A and B,|A Z Bif Jr, ce AAxr¢ Bi. 


Similarly, we can deny both sides of all definitions given below to find out what the denial 
of a newly defined term means. 

The following variants of the subset notation are sometimes needed. For all sets A and B, 
we say A is a proper subset of B, written | A Ç B |, iff} AC B and AF B |. Do not confuse 
this notation with A Z B, which means that A is not a subset of B. We can also reverse 
the order of A and B. We say B is a superset of A, written | B D A|, iff} A C B |. Similarly, 


B is a proper superset of A, written | BD A| iff} AC B.. 

Next we define union, intersection, set difference, and the empty set. These definitions 
tacitly require the following idea: to define a new set, we must specify exactly which objects 
are members of that set. (We develop this idea more formally in the next section, when we 
define set equality.) So, for example, to define the new set AU B, we need an axiom telling 
us precisely which objects belong to this set. Here is the definitional axiom we need. 


3.2. Definition: Union of Two Sets. For all sets A and B and all objects x: 


xe AUB)if|xcAorzre B. 


3.3. Definition: Intersection of Two Sets. For all sets A and B and all objects x: 
xe ANB|iff|x € Aandzre B. 


Restating these definitions more symbolically, xv € AU B means (x € A) V (x € B), 
whereas x € AN B means (x € A) A (a € B). Observe that U (the union symbol) and V 
(the OR symbol) both open upwards, whereas N (the intersection symbol) and A (the AND 
symbol) both open downwards. This observation may help avoid confusing the two symbols 
U and N. We should also remember that U and V are not the same symbol; U is used to 
combine sets, whereas V is used to combine logical statements. Similar comments apply to 
N and A. 


3.4. Definition: Set Difference. For all sets A and B and all objects x: 
xe A—B|iff xe Aand rz ZB. 


3.5. Definition: the Empty Set Q. | For all objects x, x ¢ @. 
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Subset Proofs 


Let A and B be given sets. How can we prove that A C B? By definition, we must prove 
Va,c € A => x € B. We can obtain a proof outline of this statement by combining the 
generic-element proof template with the direct proof template for IF-statements. The result- 
ing sequence of steps occurs so frequently in proofs that it is worth stating (and memorizing) 
as its own separate proof template: 


3.6. Subset Proof Template to Prove A C B. (A and B are sets.) 

Fix an arbitrary object xj. Assume x € A. Prove xp € B. 

(Continue the proof by expanding the definitions of “ro € A” and “zo € B” and using more 
proof templates.) 


Note that A and B can be any expressions standing for sets, not necessarily individual 
variables, as seen in the next example. 


3.7. Example. Let U and V be sets. Outline a proof that U x V C P(P(UUUV)). 
Solution. Although the given statement uses potentially unfamiliar symbols (to be discussed 
later), we note that the statement is asserting that one set is a subset of another. So we can 
generate the proof outline immediately: 

Fix an arbitrary object vp. Assume zo € U x V. Prove xo € P(P(U UV)). 


The next theorem lists a host of properties of the subset relation. 


3.8. Theorem on Subsets. For all sets A, B, C: 
(a) Reflexivity: A C A. 

(b) Transitivity: If AC B and BCC, then ACC. 

(c) Lower Bound: ANB C A and ANBCB. 

(d) Greatest Lower Bound: C C AN B iff (C C A and C C B). 
(e) Upper Bound: A C AU B and B C AUB. 
(£) 
( 
( 
( 


g) Minimality of Empty Set: 0 C A. 

h) Difference Property: A— BC A. 

i) Monotonicity: If AC B, then: ANC C BNC and AUC C BUC and A~-CC B-C. 
(j) Inclusion Reversal: If A C B, then C — BC C — A. 


) 
f) Least Upper Bound: AU B C C iff (A C C and BCC). 
) 


We now use the subset template, combined with other proof templates and definitions, 
to prove some parts of this theorem. We ask the reader to prove the other parts in later 
exercises. Throughout all these proofs, we let A, B, and C be fixed, but arbitrary, sets. 


3.9. Proof of Reflexivity. We must prove A C A. Fix an arbitrary object xo. Assume 
zo € A. Prove zo € A. Since we just assumed that xp € A, the proof is done. 


3.10. Proof of Upper Bound. We must prove “A C AU B and B C AU B.” To prove 
the AND-statement, we prove each part separately. 

Part 1. Prove A C AUB. Fix an arbitrary object rp. Assume zo € A. Prove rp E€ AUB. 
We must prove “zo E€ A or x € B.” Since zo € A was assumed to be true, the truth table 
for OR shows that the required OR-statement is also true. 

Part 2. Prove B C AUB. Fix an arbitrary object yo. Assume yo € B. Prove yo € AUB. 
We must prove “yo € A or yo € B.” Since yo € B was assumed to be true, the truth table 
for OR shows that the required OR-statement is also true. 
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3.11. Proof of Minimality of Empty Set. We must prove Ø C A. [If we mechanically 
recite the subset proof template, we generate these lines: “Fix xp. Assume 2g € Ø. Prove 
zo € A.” It may be unclear how to proceed from this point. So we try the proof again, 
starting from the definition of subsets.] We must prove Vz,x € Ø > x € A. Fix an arbitrary 
object xo. We must prove the IF-statement “ro € Ø > zo € A.” Now, the hypothesis 
“zo € Ø” is false by definition of the empty set. So the entire IF-statement is true, by the 
truth table for IF. Thus we have proved Ø C A. 

You may find the proof just given to be somewhat cryptic. If so, you may prefer the 
following contrapositive proof of the IF-statement. Assume zo ¢ A; prove xo Z Ø. The goal 
is true by definition of the empty set [and we do not need to use the assumption here]. 
Similarly, you can prove Ø C A by a contradiction proof (see Exercise 10). This example 
illustrates the following general observation. It is often easier to prove statements involving 
the empty set using negative logic, such as a contrapositive proof or a proof by contradiction. 


We continue proving parts of the Theorem on Subsets in the next section. 


Section Summary 


1. Subsets.| A C B| means |Vxr,z E AS>xre BI. 
To prove A C B: fix an arbitrary object 79; assume xo € A; prove x € B. 


2. Set Operations. Memorize these definitions: for all sets A, B and all objects x, 


Union:|xe€ AUB/iff|xae Aorzre B\. 


Intersection: |x € AN B\iff}x’e Aandze B\. 


Set Difference: |x € A— B/iff}uec Aandz ¢ B). 


Empty Set: |Vx,x g |. 


3. Theorem on Subsets. Memorize the facts listed in the Theorem on Subsets for 
later use. In particular, for all sets A, B, C: Ø and A— B and A are always subsets 
of A; AC BCC implies A C C; AN B is a subset of both A and B; CC ANB 
if CC A and C C B; A and B are both subsets of AUB; AUBCCif ACC 
and B C C; and AC B implies ANC C BNC, AUC C BUC, A-CCB-C, 
andC—-BCC-A. 


Exercises 
1. Let X = {1,2,3,4,5,6, 7,8,9,10}, A = {1,3,5,7,9}, B = {1, 2,3,4,5}, and C = 
{2,4,6,8,10}. Informally compute the following sets: (a) AU B, ANB, ANC, 
(AUB)NC, AU(BNC). (b) B-A, A-B, B-C, C B, A C, C — A, 
B-B. (c) X-A, X-B, A- X, X- (ANB), (KAU = B), (X — A) SB, 
—- (A-B). 


2. In the previous exercise, consider the following ten sets: 0, AN B, BNC, A, B, 
C, AU B, BUC, X — A, X. Make a 10 x 10 chart showing, for each pair S, T 
of sets from this list, whether S C T. 


3. Let S = {0,3,5,9,0.6,7}, T = {4,5,7,3/5, V2}, and U = {V2, v4, V9, v0.36}. 
Informally compute the following sets: (a) SAT, SAU, TAU, SNTNU. (b) SOZ, 
TAQ, UNZ, S-Q, U- Q, (U-S)-T,U-(S-—T). 
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4. 


10. 


12: 


13. 


14. 


15. 


16. 
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In the Venn diagram shown below, express each of the seven numbered regions in 
terms of the sets A, B, and C. For example, region 1 can be written (A — B) —C 
or A— (BUC). 


C 


Make copies of the Venn diagram shown above with the following regions shaded. 
(a) A- (B-C) (b) (B— A)NC (c) AU (B-C) (d) (AUC) -—B (e) (AUB) NAC 
(f) AU(BNC). 

Draw a Venn diagram showing four sets A, B, C, D with the following properties: 
ANC =; ANB 404 BNC; D C B—A. Then shade in the set (CUD)—(CND). 


Draw a Venn diagram with five sets where the intersection of any two of the sets 
is not empty, but the intersection of any three of the sets is empty. 


Create a proof template for each situation below. 

(a) Prove A C B by expanding the definition and using a contrapositive proof. 
(b) Prove A C B using proof by contradiction. 

(c) Disprove A C B. 


Prove parts (c) and (h) of the Theorem on Subsets: for all sets A, B, ANBCA 
and ANBC Band A-—BCA. 


Fix an arbitrary set A. Prove Ø C A using proof by contradiction. 


Disprove each statement. 

(a) For all sets A and B, AUBCB. 

(b) For all sets A and B, AC A-B. 

(c) For all sets A and B, if AC B then C- AC C-B. 


Suppose we replace “For all” by “There exists” in each part of the previous 
exercise. Prove that the resulting statements are all true. 


Draw Venn diagrams illustrating (as best you can) the following items from the 
Theorem on Subsets: (a) lower bound; (b) difference property; (c) least upper 
bound; (d) monotonicity. 


Outline the proof of each statement; do not expand unfamiliar definitions. 
(a) For all sets S, Int(S) C Clo(S). 

(b) For all sets F and S, if F is closed and F D S, then F D Clo(S). 

(c) For all sets S and all z, if a € Int(S), then Ir € Rso, B(a;r) C S. 


Prove this part of the Theorem on Subsets: for all sets A, B, if A C B, then 
ANCCBNCandA-CCB-C. 

Give a specific example of sets A, B, C, D, E where AC C, BC C, C C D, and 
C C E are all true, but AC B, BC A, DC E, and E C D are all false. 
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17. (a) Prove: for all sets A, B, C, D,if AC Band BC CandC CDan DCA 
then C C B. (b) Suppose we omit the hypothesis A C B in (a). Prove or disprove 
the resulting statement. (c) Suppose we omit the hypothesis B C C in (a). Prove 
or disprove the resulting statement. 

18. Ten sets created from sets A, B, and C are listed below. Make a 10 x 10 chart 
showing, for each pair of sets S,T from this list, whether S C T must be true for 
all choices of A, B, and C. 


(AN B)NC, (ANB)—C, (AN B)UC, (AUB)NC, (AUB) —C, (AUB)UC, 


(A—B)UC, (A— B)NC, (A—B)-C, A- (B-C). 


19. For each pair of sets S, T listed in the previous exercise, decide whether $1 T 
must be the empty set for all choices of A, B, and C. 


110 An Introduction to Mathematical Proofs 


3.2 Subset Proofs and Set Equality Proofs 


This section continues our discussion of subset proofs, as well as the related idea of using 
a known statement A C B to prove other facts. We then define set equality and look at 
various ways to prove that two sets are equal. 


Subset Proofs, Continued 


In the last section, we described how to prove statements of the form A C B. Recall the 
main template used for such proofs goes like this: 


Fix rp. Assume 7% € A. Prove xp € B. 


Typically, the proof continues by expanding the definitions of “xp € A” and “zo € B.” 

Sometimes, instead of proving a statement of the form A C B, we already know that 
this statement is true. The following rule, which we might call a knowledge template, tells 
us how the known statement A C B can be used to deduce further conclusions. 


3.12. When A C B is Already Known or Assumed: 
(a) If you know that zo € A, you may deduce that zo € B. 
(b) If you know that xo ¢ B, you may deduce that xo ¢ A. 


To justify this, recall that A C B means Yz, x € A => x € B. By the Inference Rule for 
ALL, we may take x = xo here to deduce that “ro E€ A > zo € B” is true. On one hand, if 
zo € A is known, then zo € B follows by the Inference Rule for IF. On the other hand, if 
xo ¢ B is known, then xo ¢ A follows by taking the contrapositive and using the Inference 
Rule for IF. 

The next few proofs illustrate both the subset proof template and the subset knowledge 
template. We are continuing to prove parts of the Theorem on Subsets, stated in the previous 
section. Throughout the discussion, we fix arbitrary sets A, B, and C. Comments about 
how we are generating the proofs appear in square brackets. 


3.13. Proof of Transitivity. We must prove: if A C B and B C C, then A C C. [Try 
a direct proof of the IF-statement.] Assume A C B and B C C. Prove A C C. [We are 
now proving a subset statement, so use the subset proof template.] Fix an arbitrary object 
xo. Assume zo € A. Prove xo € C. [To use the assumptions, invoke the subset knowledge 
template twice.] Since A C B and zo € A, we deduce xo € B. Since B C C and xo € B, we 
deduce zo € C. The proof is now done. 


We can give some intuition for the result just proved using Venn diagrams. Here is a 
visual version of the transitivity of C: 


IF AND THEN 
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Although these Venn diagrams provide valuable visual insight into why transitivity should 
be true, the diagrams by themselves are not a proof of this result. The preceding picture 
only shows that transitivity holds for the three specific sets shown (namely, the sets of 
points within the circles labeled A, B, and C). In other words, the Venn diagrams prove 
the existential statement: “There exist three sets A, B, C, such that if AC B and BCC, 
then A C C.” But the diagrams do not prove the stronger universal statement: “for all 
sets A, B, C, if A C B and B C C, then A C C” asserted in the theorem. Although the 
visual intuition conveyed by the diagrams is certainly compelling, we have no guarantee 
that this intuition extends to all abstract sets, which might be sets of numbers, or sets 
of functions, or sets of cows, or sets of other sets, not merely sets of points that we can 
draw. To get a rigorous proof of the universal version of the statement, we must rely on the 
formal definitions and the rules of logic, rather than invoking pictures. Similar comments 
apply to many other theorems about sets: Venn diagrams can prove existential statements, 
and they can supply intuition for other kinds of statements, but they cannot prove universal 
statements about all sets. 


3.14. Proof of Inclusion Reversal. We must prove: if A C B, then C — B C C — A. [Try 
a direct proof of the IF-statement.] Assume A C B. Prove C— B C C — A. [We are proving 
a subset statement, so use that template.] Fix an arbitrary object £o. Assume xo € C — B. 
Prove zo € C — A. We have assumed zo € C and zo ¢ B. We must prove xp € C and 
xo ¢ A. First, prove zo € C; note this statement has already been assumed. Second, prove 
xo ¢ A. [Use the subset knowledge template, part (b).] Since we have assumed A C B and 
zo ¢ B, we can deduce that rg ¢ A, as needed. 


The proof reveals why the inclusion of A within B gets reversed to become an inclusion 
of C— B within C— A. The following Venn diagrams provide additional intuition for why this 
reversal happens. The circle for A is nested within the circle for B, since we are assuming 
A C B. The shaded part of the left diagram shows C — B, whereas the shaded part of 
the right diagram shows C — A. We see that the first shaded shape is contained within the 


second shaded shape. 


If you still think that these diagrams are enough to prove the result, note that the 
diagram makes hidden assumptions about how the set C intersects the other two sets. 
What if the circle for C is completely outside the circle for A, or completely inside the circle 
for B, or in some other position? What if A happens to be the empty set? We would need 
new Venn diagrams to cover these and other situations that might arise. On the other hand, 
the formal proof given earlier works for generic (arbitrary) sets A, B, and C. 

Our next proof introduces another key ingredient in proof-writing: using previously 
proved results to shorten the proofs of new results. 


3.15. Proof of Least Upper Bound. We must prove AUBCC iff ACC and BCC. 
[This is an IFF-statement, so we need a two-part proof.| 

Part 1. Assume AU B C C. Prove AC C and B C C. [We must prove an AND-statement, 
so we need a two-part proof within part 1.] 


B 


B 
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Part 1a. Prove A C C. Fix ap. Assume xp € A. Prove 7 € C. We already proved 
(Theorem on Subsets, part (e)) that A C AUB. Since xo € A, it follows that rp € AUB. 
Since we assumed AU B C C, it follows that zo € C, as needed. 

Part 1b. Prove B C C. [We could prove this as in part 1a, but here is a faster proof using 
known theorems.] We have already proved that B C AU B, and we have assumed that 
AUB C C in part 1. By transitivity (Theorem on Subsets, part (b)), B C C follows. 
[Part la could have been proved in this way, as well. Observe that our proof of Part la 
essentially repeats the proof of transitivity.] 


Part 2. Assume A C C and B C C. Prove AUB C C. [Our goal is a subset statement, so 
we use that template.] Fix z9. Assume zo € AU B. Prove zọ € C. We have assumed zo € A 
or zo € B. [Having assumed an OR-statement, we are led to a proof by cases.] 

Case 1. Assume zo € A. Prove zp € C. Since we have assumed A C C and z € A, zo E C 
follows. 

Case 2. Assume zo € B. Prove zo € C. Since we have assumed B C C and z € B, zo E€ C 
follows. 


Set Equality 


When are two sets equal? Informally, a set is a collection of objects called the members of 
the set. With this intuition, it seems reasonable to say that two sets should be equal iff they 
have precisely the same members. This leads to the following definition, which is sometimes 
called the Axiom of Extension. 


3.16. Definition: Set Equality. For all sets A and B: 
A=B)\iff|Va,rEeASre Bl. 


We can use logical transformations to relate this definition to the definition of subsets. 
First, Vx,x € A@ x € B is equivalent to 


Va, ((x € ASareEB)A(teBsxeA)) 


by the Theorem on IFF. Second, by the quantifier property linking V and ^, the previous 
formula is equivalent to 


VyyEASyEeEB) A (Vz,z€B>2z€A). 


We recognize the definition of subset in two places here, so the last formula is equivalent 
to AC Band B C A. Chaining together these equivalences, we conclude: 


For all sets A and B,| A= B|iff| AC Band BCA. 


This leads to the following proof template for proving that two sets are equal. 


3.17. Proof Template to Prove a Set Equality A = B. (A and B are sets.) 
Part 1. Prove A C B. 
Part 2. Prove BC A. 


Typically, we would use the subset proof template to accomplish parts 1 and 2, though 
other methods can sometimes be used (e.g., invoking previously proved theorems). The next 
theorem lists many identities asserting the equality of various sets. 

3.18. Theorem on Set Equality. For all sets A, B, C, A’, B’: 
(a) Reflexivity: A = A. 
(b) Symmetry: If A = B, then B = A. 
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(c) Transitivity: If A= B and B=C, then A=C. 


(d) Substitution Properties: If A = A’ and B = B’, then: ANB = A'A B’ and AUB = A'U B’ 
and A— B = A' — B' and (AC B iff A’ C B^). 


e) Commutativity: AN B = BN A and AU B= BUA. 

f) Associativity: (AN B) NAC = AN (BNC) and (AU B)UC = AU (BUC). 

g) Distributive Laws: AN (BUC) = (AN B)U (ANC) and AU(BNC) = (AU B)A (AUC). 
h) Idempotent Laws: AN A = A and AU A = A. 

i) Absorption Laws: AU (AN B) = A and AN (AU B) = A. 

j) Detecting the Empty Set: (B = 0 iff Vz, z ¢ B) and (B F O iff Jz, z € B). 


(k) Properties of the Empty Set: AU Q = A and ANQ = and A— = Aand- A=0 
and A—A=9. 

(1) de Morgan Laws for Sets: A — (B U C) = (A — B) N (A — C) and A- (BNC) = 
(A— B)U (A-C). 

(m) Set Partition of One Set by Another: A = (A— B)U (AN B) and (A— B)A (ANA B) = 9. 
(n) Set Partition of One Set by a Subset: If B C A, then A—(A-— B) = B, A = BU(A-B), 
and BN (A-— B) =b. 

(o) Characterizations of Subsets: The following conditions are equivalent: 

ACB, ANB=A;AUB=B;A-B=9%. 


( 
( 
( 
( 
( 
( 


The first four properties listed here may appear to be obvious facts about equality that 
do not require proof. In certain theories, where equality is taken as an undefined term, these 
facts might be adopted as axioms about the equality symbol. But in set theory, set equality 
is a defined term, so we can prove these facts from more basic principles. As a sample, we 
prove parts (a) and (c). 


3.19. Proof of Reflexivity of Set Equality. Let A be a fixed but arbitrary set. We will 
prove A = A. [Follow the new template, replacing B by A.] Part 1. Prove A C A. This is a 
known theorem (Theorem on Subsets, part (a)). Part 2. Prove A C A. This is also a known 
theorem. So A = A holds. 


3.20. Proof of Transitivity of Set Equality. Let A, B, and C be fixed but arbitrary 
sets. We prove: if A = B and B = C, then A = C. Assume A = B and B = C; we must 
prove A = C. [Expand the assumptions using the second version of the definition of set 
equality.] We have assumed that A C B and B C A and B C C and C C B. [Now attack 
the goal using the set equality template.] 

Part 1. Prove A C C. Since A C B and B C C were assumed, A C C follows by the 
previously proved transitivity of C (Theorem on Subsets, part (b)). 

Part 2. Prove C C A. Since C C B and B C A were assumed, C C A follows by the known 
transitivity of C. 


3.21. Remark (Optional). Our definition of set equality states that for all sets A and B, 
A = B iff Vx,x € A & x € B. The forward direction of this IFF-statement can be viewed 
as a substitution property that allows us to replace a set A appearing on the right side of 
the membership symbol € by an equal set B. We need one more axiom stating an analogous 
substitution property for sets appearing on the left side of the membership symbol. Here 
is the axiom: for all sets A,B,C, if A = B, then [(A € C) © (B e C)]. Various other 
substitution properties for equal sets (such as those in part (d) of the Theorem on Set 
Equality) need not be stated as axioms, since they can be proved from the definitions. 
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Set Equality Proofs 


We continue to illustrate the proof template for set equality by proving more items from 
the Theorem on Set Equality. Remember that it is no longer necessary to prove everything 
from the definitions alone; you can and should use previously known results such as the 
Theorem on Subsets. In the following, fix arbitrary sets A, B, C, A’, and B’. 


3.22. Proof of Idempotent Laws. 

Part 1. We prove AM A = A. Part 1a. We prove AN A C A. This follows from the known 
result AN B C A (Theorem on Subsets, part (c)) by replacing B by A. Part 1b. We prove 
AC ANA. Fix an arbitrary object xo. Assume xp € A. Prove xo E€ AN A. We must prove 
“tg € A and zo € A.” To do this, [following the AND-template] first prove xo € A and then 
prove xo € A. In each part, we already assumed the statement to be proved. 

Part 2. We prove AU A = A. Part 2a. We prove AU A C A. Fix an object yo. Assume 
yo E AU A. Prove yo E€ A. We have assumed the truth of “yo E€ A or yo € A.” By the truth 
table for OR, the truth of P V P forces P to be true. So yo € A must be true, as needed. 
[If we follow the templates mechanically, we could also have finished using a proof by cases. 
Case 1 assumes yo E A and proves yo € A; case 2 assumes yo € A and proves yo € A. It 
seemed easier here to invoke the truth table instead.] Part 2b. We prove A C AU A. This 
follows from the known result A C AU B (Theorem on Subsets, part (e)) by replacing B 
by A. 


3.23. Proof that AN@ = Ø. Part 1. We prove AN Ø C Ø. This follows from the lower 
bound property (Theorem on Subsets, part (c)). Part 2. We prove Ø C AN Ø. This follows 
from the minimality of Ø (Theorem on Subsets, part (g)). 


The previous proof showed that a certain set was empty by using the standard set 
equality proof template. However, there is another way to prove that a set is empty that is 
much more frequently used. It is based on part (j) of the Theorem on Set Equality, which 
we now prove. 


3.24. Detecting the Empty Set. We prove: | B Æ Ø | iff} dz, z € B |. The other assertion 
in part (j) of the theorem follows from this one by denying each side of the IFF-statement. 
[The following proof may be considered optional.] 

Part 1. Assume B Æ @. Prove 3z,z € B. Negating the original definition of set equality, we 
have assumed that Jx,x € B@ x € Ø. Let zo be a particular object that makes “zo € B 
XOR zo € Ø” true. By definition of the empty set, xo € 0) is false. So, by definition of XOR, 
xo € B is true. Now choose z to be xo to see that Jz, z € B is true. 

Part 2. Assume Jz,z € B. Prove B Æ Ø. We have assumed there is a fixed object zọ with 
zo € B. We must prove dz,2 € B® x €Q. Choose x = zg. As in part 1, z € B is true and 
zo € Ú is false, making the required XOR-statement true. 


The equivalence of the two boxed statements in 3.24 is used constantly in proofs involving 
the empty set, so it should be memorized carefully. (The equivalence should be intuitively 
evident as well: a set B is not empty iff there exists at least one thing that is a member of 
B.) We can use this equivalence to prove that a set is empty using proof by contradiction, 
as illustrated next. 


3.25. Second Proof that AN Ø = Ø. Assume, to get a contradiction, that AN Ø 4 Ø. Our 
result above shows that there is an object zo € AN Ø. By definition of N, we deduce zọ € A 
and zo € Ø. But also zo Z @ by definition of Ø. So we have the contradiction “zo € Ø and 
zo Z Ø.” This contradiction shows that AN@ = Ø is true. 


We introduce the following terminology for sets that do not overlap. 


Sets 115 


3.26. Definition: Disjoint Sets. We say sets S and T are| disjoint |iff] SAT = f |. We say 
sets S1, S2, ..., Sn are pairwise disjoint iff for all i,j € {1,2,...,n} with i Æ j, SiN S; =0. 


For example, part (m) of the Theorem on Set Equality says that A — B and AN B are 
disjoint sets whose union is A. 


Section Summary 


1. Proving A C B vs. Knowing A C B. To prove A C B: fix x9; assume zo € A; 
prove zo € B. When you already know A C B, knowing xo € A allows you to 
deduce zo € B; and knowing yo ¢ B allows you to deduce yo ¢ A. 


2. Set Equality. For all sets A, B,| A= B | if | Yx, rE ASre BI 


Equivalently, | A = B|iff} AC Band BC A\. 
To prove two sets A and B are equal: 1. Prove A C B. 2. Prove BC A. 


3. Theorem on Set Equality. Memorize the facts listed in the Theorem on Set 
Equality for later use. In particular, set equality is reflexive, symmetric, and 
transitive; equal sets may be substituted for one another in expressions; set inter- 
section and set union obey the commutative, associative, distributive, idempotent, 
absorption, and de Morgan laws; any set A is the union of the disjoint sets A — B 
and AN B; the four statements AC B, AN B = A, AU B = B, A — B = Í are 
all equivalent. 


4. Proofs Involving the Empty Set. For any set B, prove |B=9| by proving 


Yz,x ¢ B |. Prove| B Æ f | by proving | dz, x € B |. In general, statements involv- 


ing the empty set may be easier to prove using negative logic (e.g., proof by 
contrapositive or proof by contradiction). 


Exercises 


1. Prove part (d) of the Theorem on Subsets: for all sets A, B, C, C C AN B iff 
(CC AandCC B). 


2. Prove part (b) of the Theorem on Set Equality: for all sets A, B, if A = B, then 
B=A. 


3. Prove part (e) of the Theorem on Set Equality: for all sets A, B, ANB= BNA 
and AU B = BUA. 


4. Prove: for all sets A, A — = A. 

5. (a) Prove: for all sets A, B, C, if C C B, then A -— B and C are disjoint. (b) Use 
(a) to prove: for all sets A and B, A — B and B — A are disjoint. 

6. Prove: for all sets A and B, A— B = A- (ANB). 
Give intuition for parts (1), (m), and (n) in the Theorem on Set Equality by 
drawing appropriate Venn diagrams. Do these diagrams prove these set identities? 

8. Prove part (m) of the Theorem on Set Equality: for all sets A and B, 
A=(A-— B)U(AN B)and (A— B)N (ANB) =9. 

9. Prove part (n) of the Theorem on Set Equality: for all sets A, B, if B C A, then 
A-—(A-B)=Band A= BU(A-—B) and BN (A -— B) = 9. (Use other parts 
of the theorem, such as parts (m) and (o), to obtain a shorter proof.) 
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Prove: for all sets R and S, the sets R — S and S— R and R&S are pairwise 
disjoint, and the union of these sets is RU S. Illustrate this result with a Venn 
diagram. 

Prove: for all sets A, B, C, D, if A D B D C D D, then A- B, B—C,C—D, D 
are pairwise disjoint, and the union of these sets is A. Illustrate this result with 
a Venn diagram. 


Use a Venn diagram to express AU BUC as the union of: (a) two disjoint sets; 
(b) three disjoint sets; (c) seven disjoint sets. 


Use the Theorem on Set Equality (not definitions or proof templates) to prove 
the following identities hold for all sets. Justify each step by citing a part of the 
theorem. 

(a) (RUS)U(ROS)= RUS. 

(b) RU((SNT)NU) = ((RUS)N(RUT))A(RUYV),. 


Use Venn diagrams to disprove each statement. Discuss why Venn diagrams are 
allowed here, despite earlier claims that set identities could not be proved via 
Venn diagrams. 

(a) For all sets A and C, C = Au (C — A). 

(b) For all sets A, B, C, A — (B — C) =(A-B)-C. 

(c) For all sets A, B, C, AU (BAC) =(ANB)U(ANC). 


Disprove each statement in the preceding problem by constructing specific exam- 
ples where every set is a subset of {1, 2, 3, 4}. 


Find and prove necessary and sufficient conditions on the sets A and C for C = 
AU (C — A) to be true. 


Find and prove necessary and sufficient conditions on the sets A, B, and C for 
A-— (B-C) = (A — B) — C to be true. 


(a) Prove: for all sets A, B, C, (A— B)U (B-C) 2 A-C. 

(b) Disprove: for all sets A, B, C, (A — B)U (B- C) = A-C. 

(c) Prove: for all sets A, B, C, if C C B C A, then (A — B)U (B-C) = A-C. 
(d) Prove or disprove the converse of the statement in (c). 

(a) Prove: for all sets A, B, and C, if A — C = B — C, then AUC = BUC. 
(b) Prove or disprove the converse of part (a). 


Prove or disprove: for all sets A, B, and C, if A— C = B—C and ANC = BNC, 
then A= B. 
Prove or disprove: for all sets A, B, and C, if A— C = B-C and C — A = C — B, 
then A = B. 


Find and prove necessary and sufficient conditions on the sets A, B, and C for 
AU(BNC) =(AUB)NC to be true. 
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3.3 Set Equality Proofs, Circle Proofs, and Chain Proofs 


Set equality proofs occur everywhere in mathematics, so we keep studying them in this 
section. A set equality is a particular kind of IFF-statement. In the second part of the 
section, we develop two new ways for proving IFF-statements — the circle proof method 
and the chain proof method — that can help prove set equalities and other biconditional 
statements. 


Set Equality Proofs, Continued 


The most common method for proving A = B, where A and B are sets, is to use the set 
equality proof template: 

Part 1. Prove A C B. 

Part 2. Prove BC A. 

In this section, we continue practicing this proof method, but we also introduce several 
other ways of proving that two sets are equal. We first recall one such alternate method 
that is often used to prove that a given set equals the empty set. 


3.27. To Prove B = Í (where B is a set): 

Assume, to get a contradiction, that B Æ 0. 

We have assumed that there is an object zo with zo € B. 
Expand the definition of “zo € B” to derive a contradiction. 
Conclude that B = @. 


3.28. Example. Prove that for all sets A, A — A= Í. 
Proof. Assume, to get a contradiction, that A—A 4 Ø. Then there exists an object zo E€ A—A. 
By definition of set difference, zọ E€ A and zp ¢ A. This is a contradiction, so A — A = Í. 


3.29. Practice with Set Proofs. Try to prove the following three statements yourself, 
before reading the answers given below. Let A, B, and C be fixed, but arbitrary sets. Here, 
you may use any parts of the Theorem on Subsets except the result being proved. 


(a) Prove: If A C B, then AUC C BUC. 
(b) Prove: Ø — A = and A= AUD. 
(c) Prove: A — (BUC) = (A- B) A (A -— ©). 


3.30. Answer to Practice Proof (a). [Use the IF-template to prove an IF-statement.] 
Assume A C B. Prove AUC C BUC. [Now use the subset template to prove the new goal.] 
Fix an arbitrary object zo. Assume xp E€ AUC. Prove xo € BUC. We have assumed xp € A 
or Zo E€ C, and we must prove zo € B or ao € C. [Since an OR-statement was assumed, use 
cases. ] 

Case 1. Assume zo € A; prove zo € B or x € C. Since A C B is known, we deduce that 
zo E€ B, so “xo € B or x € C” is true by definition of OR. 

Case 2. Assume zo € C; prove zo € B or xg € C. This follows by definition of OR. 


3.31. Answer to Practice Proof (b). Part 1. Prove Ø — A = Ø. Assume, to get a 
contradiction, that Ø — A Æ Ø. Then there exists zo € Ø — A. By definition of set difference, 
zo € Ú and zo ¢ A. But, we also know that zọ ¢ Ø. The contradiction “zo € Ó and zo ¢ Ø” 
proves that 0 — A= Í. 

Part 2. Prove A= AU9Q. Part 2a. Prove A C AU Q. This is a known theorem (take B = 0) 
in part (e) of the Theorem on Subsets). Part 2b. Prove AU Ø C A. Since Ø appears here, 
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we try a contradiction proof. Assume, to get a contradiction, that A U Ø Z A. Then there 
exists zq with zọ E€ AU@ and zọ ¢ A. We know “zg € A or 2 € 0” is true. However, z9 € A 
is false by assumption, and zg € 9) is false by definition of the empty set. So, “zo € A or 
zo € Ø” is false. This contradiction proves part 2b. 


3.32. Answer to Practice Proof (c). Part 1. Prove A— (BUC) C (A—B)n(A-C). Fix 
Xo; assume zo E€ A— (BUC); prove xo E€ (A— B) N (A — C). We have assumed xo € A and 
zo € BUC; we must prove zo E€ A— B and zo E€ A — C. Continuing to expand definitions, 
we have assumed zo € A and (zo ¢ B AND ao ¢ C); we must prove xp E€ A and 29 ¢ B and 
xo E A and xo ¢ C. All four parts of the new goal have already been assumed, so part 1 is 
done. 

Part 2. Prove (A— B)N(A—C) C A—(BUC). Fix yo; assume yo E€ (A— B) N (A-— C); prove 
yo E€ A—(BUC). Expanding definitions as in part 1, we eventually see that we have assumed 
(yo E AA yo Æ B)A (yo E AA yo Z C), and we must prove (yo E€ AA (yo € BA yo € C)). 
All three parts of this AND-statement have already been assumed, so part 2 is done. 


Circle Proof Method 


Consider part (o) of the Theorem on Set Equality. This item asserts that four propositions 
(call them P, Q, R, and S) are all equivalent. More explicitly, part (o) says that 


(PBQ)AQSRA(RSS)A(SSP)A(PSRA(QSS). 


Using the AND-template and IFF-template to prove this, we would need to perform twelve 
independent subproofs: 


la. Prove P > Q. 1b. Prove Q > P. 
2a. Prove Q > R. 2b. Prove R> Q. 
3a. Prove R > S. 3b. Prove S => R. 
4a. Prove S => P. 4b. Prove P > S. 
5a. Prove P > R. 5b. Prove R > P. 
6a. Prove Q > S. 6b. Prove S > Q. 


That is a lot of work! It turns out, however, that there is a shorter way. Suppose we give 
proofs for Steps la, 2a, 3a, and 4a, so we know P => Q and Q => R and R > S and S > P. 
You can check that 
((P = Q) ^ (Q = R)) > (P => R) 

is a tautology (called “Transitivity of =”). So, once we complete Steps la and 2a, the 
Inference Rule for IF tells us that P = R is true, completing Step 5a. Using this fact 
and Step 3a, the Inference Rule for IF tells us that P = S is true, completing Step 4b. 
Repeatedly using Transitivity of IF and the Inference Rule for IF, we see that Steps 2a and 
3a and 4a yield Q = S and Q = P, which takes care of Step 6a and Step 1b. Continuing in 
this way, we see that after proving Steps la, 2a, 3a, and 4a, the eight remaining statements 
all follow automatically. Thus it suffices to prove the four statements in la, 2a, 3a, and 4a. 
This is called the circle method for proving the equivalence of four statements, because of 
the following picture: 


P > Q Ps Q 
t %4 implies f t. 
S = R S = R 


Following the single arrows (known IF-statements) around the left circle and using Tran- 
sitivity of IF, we deduce the bidirectional arrows (IFF-statements to be proved) appearing 
in the right circle, together with the diagonals P © R and Q © S. The circle method 
generalizes to any number of statements, as follows. 
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3.33. Circle Method to Prove P,, P2,..., Pa are Logically Equivalent. 
Part 1. Prove P, = P, (by any method). 
Part 2. Prove P > P; (by any method). 


Part i. Prove P; => P;+ı (by any method). 


Part n. Prove P, => Pı (by any method). 
Let us illustrate the circle method by proving part (o) of the Set Equality Theorem. 


3.34. Proof of Subset Characterization. Let A and B be fixed, but arbitrary sets. 
Part 1. Prove: AC B> ANB=A. 

Part 2. Prove: ANB=A=>AUB=B. 

Part 3. Prove: AUB=B>A-—B=9. 

Part 4. Prove: A- B=@=> ACB. 

[Stop reading here, and try to fill in the proofs of each part yourself. Hints: Use the Theorem 
on Subsets when you can; try proof by contradiction in part 3; try a contrapositive proof 
in part 4. Remember that sets already known to be equal have the same members.] 


Proof of Part 1. Assume A C B. Prove AN B= A. 

Part 1a. Prove AN B C A. This is already known (part (c) of the Theorem on Subsets). 
Part 1b. Prove A C AN B. Fix an arbitrary object rp. Assume zo € A. Prove zo € ANB. 
We must prove 2p € A and 2 € B. First, prove zo € A; note we have already assumed this. 
Second, prove x9 € B. To do so, recall we have assumed A C B and xo € A, from which 
xo € B follows [by the knowledge template for subsets]. 


Proof of Part 2. Assume AN B = A. Prove AUB = B. 

Part 2a. Prove AU B C B. Fix an arbitrary object yo. Assume yo E€ AU B. Prove yo E€ B. 
We know yo € A or yo € B, so use cases within part 2a. 

Case 1. Assume yo € A; prove yo € B. Since yo € A and AN B = A is known, we deduce 
that yo E€ AN B. Since AN B C B is known, we deduce that yo € B as needed. 

Case 2. Assume yo € B; prove yo € B. This is immediate. 

Part 2b. Prove B C AU B. This is known from part (e) of the Theorem on Subsets. 


Proof of Part 3. Assume, to get a contradiction, that AU B = B and A— B # Ø. The 
second assumption tells us that there is an object zo with z9 € A — B. This means zọ € A 
and zo ¢ B. On one hand, since zọ E€ A and A C AUB, we see that zọ E€ AU B. We have 
also assumed AU B = B, so z € B follows. We now have the contradiction “zo € B and 
zo € B,” which completes the proof of part 3. 


Proof of Part 4. Using the contrapositive method, assume A Z B; prove A— B Æ §. We have 
assumed there exists an object zo such that zo E€ AA zo ¢ B. We must prove Jw, w € A—B, 
which means Jw, w € A Aw ¢ B. Choosing w = zo completes the proof. 


Having proved part (o) of the Theorem on Set Equality, we can quickly deduce the 
absorption laws (part (i)) as follows. 


3.35. Proof of Absorption Laws. Let X and Y be arbitrary sets; we prove X U (X A 
Y) = X and XN(X UY) = X. Since XAY C X is already known, part (o) of the 
theorem (with A there replaced by X N Y, and with B there replaced by X) tells us that 
(XNY)UX = X. Assuming commutativity of union has already been proved, we conclude 
that X U(X NY) = X. Similarly, since X C X UY is already known, part (o) gives 
XM(X UY) =X, which is the second absorption law. 
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Chain Proof Method 


Suppose we are trying to prove an algebraic identity of the form a = c, where a and c are 
expressions denoting real numbers. A common way to proceed is to write down a sequence 
of known identities that looks like this: 


We know a = bı A by = b2 A b2 = b3 A b3 = b4 A b4 = ©. 


” 


By repeatedly using the theorem “Vz,y,z € R, if x = y and y = z, then x = z,” we 
can conclude that a = c. This theorem is almost never mentioned explicitly; instead, we 
abbreviate the above discussion by presenting a single chain of known equalities 


a by bo bs b4 C 


that begins with a (the left side of the equality to be proved) and ends with c (the right 
side of the equality to be proved). We state this method as a new proof template. 


3.36. Chain Proof Template to Prove a = c. To prove a = c, we can present a chain 
of known equalities 
a bi by b3 ee bn—1 bn Cc 


starting at a and ending at c. The letters a, b1,..., bn, c represent any expressions denoting 
objects of the same type (e.g., sets, numbers, functions, etc.). 


3.37. Example. We use a chain proof to prove Vz,y € R, (x — y)(x + y) = x? — y’. Fix 
arbitrary x,y € R. We compute 


(x—-y)(a+y) = 2(aty)—y(ety) = ca+ay—ye—yy = 27 +2y—-ay—y? = 2? +0-y? = 2? —y?. 


Each equality in the chain is true because of a law of algebra; for instance, the first two 
equalities hold by the distributive law, and the third holds by commutativity of multiplica- 
tion and the definitions z? = xx and y? = yy. We conclude that (x — y)(x + y) = 27 — y?, 
as needed. 


Now let us return to the problem of proving IFF-statements. You can check that 
(Pe Q)A\(Q@ R)) = (P & R) 


is a tautology (called “Transitivity of =”). By analogy with what we did above for equality 
proofs, we can formulate the following chain method for proving IFF-statements. 


3.38. Chain Proof Template to Prove P © R. To prove the IFF-statement P © R, 
we can present a chain of known IFF-statements 


PEQ Q295036°:°°-SQn-18S QR 


starting at P and ending at R. 


Note that the displayed formula is really an abbreviation for 


(P © Q1) A (Q€ Q2) A+: A (Qn € R). 


For instance, P & Q © R (with no parentheses) is being used to abbreviate (P = Q)A(Q = 
R). This abbreviating convention is dangerous, since (for example) the propositional form 
(P & Q) © R is not logically equivalent to (P = Q) A (Q © R). However, expressions 
such as (P & Q) © R seldom arise in practice, and context frequently tells us when this 
abbreviation is being used. To illustrate the chain method, we prove part (f) of the Theorem 
on Set Equality. 
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3.39. Proof of Associativity of N and U. Let A, B, C be fixed sets. We must prove 
(AN B)NC=AN(BNC). By the original definition of set equality, we must prove: 


Va,x €(ANB)NC sxe AN(BAC). 


Let xo be a fixed object. To prove the IFF-statement for x9, we present the following chain 
of known equivalences: 


zo E (AN B)NAC S (ap EANB)AXMEC 
< (xo E AAT E B)AToEC 
S zo E AA (zo E BAX E C) 
S zo E AA (zo E BNC) 
< zro E AN(BNC). 


The third equivalence is true by the known associativity of AND (see the Theorem on 
Logical Equivalences). All the other equivalences are instances of the definition of N. We 
can prove associativity of U in the same way, by replacing N by U and A by V everywhere 
in the proof above. 


We can give similar chain proofs for several other parts of the Set Equality Theorem 
(including commutativity, the distributive laws, and the absorption laws), by reducing the 
identities for sets to corresponding known equivalences from the Theorem on Logical Equiv- 
alences. This proof technique vividly reveals how properties of the logical operators lead 
directly to analogous properties of set operations. 


Transitivity and Chain Proofs (Optional) 


At this point, we have encountered several mathematical relations that have a property 
called transitivity. We give an abstract definition of this property later, when we study 
relations (see 86.1). We now list some specific examples of the transitive law occurring in 
algebra, set theory, and logic. 

3.40. Transitivity Properties. 

(a) Transitivity of Equality: For all objects a,b,c, if a = b and b = c, then a = c. 

(b) Transitivity of <: For all a,b,c € R, if a < b and b < c, then a < c. 

(c) Transitivity of >: For all a,b,c € R, if a > b and b > c, then a > c. 

(d) Transitivity of Subsets: For all sets A, B,C, if AC B and B C C, then ACC. 

(e) Transitivity of IF: For all propositions P, Q, R, if P > Q and Q = R, then P > R. 

(£) 


f) Transitivity of IFF: For all propositions P, Q, R, if PS Q and Q € R, then PS R. 


Whenever we have a transitive relation, we can use the chain proof technique to prove 
that this relation holds between two given objects. We have already discussed chain proofs 
of equalities and IFF-statements. To give a chain proof of an inequality such as a > c, we 
present a chain of known inequalities 


a > bı > bz > b3 >- > bn >, 


which abbreviates “a > bı and bı > b2 and bg > bz and ... and bn > c.” Using Transitivity 
of > repeatedly, we successively deduce a > b2, then a > b3, and so on, until finally we 
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obtain the needed conclusion a > c. To prove A C C by the chain proof method, we find a 
chain of known set inclusions 


ACB, CBs CBs Cus CB, Ce. 


To prove P = R by the chain proof method, we find a chain of known IF-statements 


P > Qi > Q2 >= Q3 > > Qn => R. 


Beware: the chain proof technique is very convenient when it succeeds, but this proof 
method cannot be used in all situations. When trying to prove a complicated set inclusion 
or set equality, it is often safer to use the subset proof template or set equality template 
rather than attempting a chain proof. 


Section Summary 


1. Circle Proof Method. To prove that statements Pi, P2,...,P, are all logically 
equivalent, we can prove Pi = P and P> => P; and ... and P,-1 => Php and 
P,, => Pı. Reordering the original list of statements is allowed and may make the 
implications easier to prove. 


2. Chain Proof Method for Equalities. To prove an equality a = c, find a chain of 
known equalities a = bı = b2 =--- = bn = c starting at a and ending at c. 


3. Chain Proof Method for IFF Statements. To prove an IFF statement P = R, 
find a chain of known equivalences P © Q1 € Q2 --+- &Q, € R starting at 
P and ending at R. 


4. Chain Proof Method for Transitive Relations. Whenever we have a transitive 
relation (such as =, <, <, >, >, C, >, or =), we can prove the relation holds 
between given objects A and B by finding a finite chain of known relations leading 
from A to B through intermediate objects. Each step in the chain should be 
explained by citing known results. 


5. Advice for Set Proofs. When asked to prove a complicated statement involving 
sets, steadfastly keep applying the proof templates for subsets, set equality, set op- 
erations, and the logical operators as they arise within the statement. Remember 
the special templates involving the empty set, and be ready to try contrapositive 
proofs or proof by contradiction. Some set identities can be reduced to logical 
identities by using a chain proof. 


Exercises 


1. Outline a proof of the following statement using the circle method. For all sets 
X CR”, the following statements are equivalent: X is compact; X is sequentially 
compact; X is closed and bounded; X is complete and totally bounded. 


2. Given statements P, Q, R, S, T, suppose we have proved P > Q and Q = S 
and S > P and T => R and R= P. Have we proved that all five statements are 
logically equivalent? If not, find one additional IF-statement that will complete 
the proof; give all possible answers. 


3. Repeat the previous question assuming we have proved P © R and Q & R and 
SsRandTssS. 


4. Give a chain proof of the commutative laws for set union and intersection (The- 
orem on Set Equality, part (e)). 


Sets 


14. 


15. 


16. 


17. 


18. 


19. 


20. 
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Give a chain proof of the distributive laws for set union and intersection (Theorem 
on Set Equality, part (g)). 


Give chain proofs of the following algebraic identities. (You will need the dis- 
tributive law and similar facts.) 

(a) Vz,y ER, (x +y)? = £? + 2zy ty’. 

(b) Vz,y € R, (x — y)(a? + zy + y?) = r’ — y’. 

Let x be a fixed positive real number. Prove the following statements are equiv- 
alent using a circle proof: x > 1; 2? > 2; 1/x? < 1/2; 1/x < 1. 


Let x be a fixed negative real number. Which pairs of statements in the previous 
exercise are logically equivalent? 


Using any method, prove this part of the Theorem on Set Equality: for all sets 
A,B,C, A— (BAC) =(A- B)U(A-C). 


Give direct proofs of the implications A C B > AU B = B and AANB = A > 
AC B in part (0) of the Theorem on Set Equality. 


Prove: for all sets A, B, C, (A — B)—- C = A — (BU C) = (A —- C) — B. 
Prove part (d) of the Theorem on Set Equality (substitution properties). 


Prove: for all sets A and B, the following statements are equivalent: AN B = 0; 
A-B=A;B-A=B,;VCCA,VDCB,CND=9. 


Prove: for all x € Ryo, the following statements are equivalent: x is not rational; 
x/3 is not rational; 3/x is not rational; (3/2) + 2 is not rational; 2x + 3 is not 
rational. 


Define the symmetric difference of two sets A and B as follows: 


Vz, x E€ AAB elre AGTEB. 


Informally, AAB consists of all objects that belong to exactly one of the sets A 
and B. Prove the following identities hold for all sets A, B, and C. 

(a) AAB = BAA (commutativity). 

(b) (AAB)AC = AA(BAC) (associativity). 

(c) AAM = A = OAA (identity). 

(d) (AAB)NC = (ANC)A(BNC) (distributive law). 


Illustrate the identities in the previous exercise (as best you can) using Venn 
diagrams. 


Let A and B be arbitrary sets. 

(a) Prove: AAB = (A — B) U (B — A). 

(b) Prove: AAB = (AU B) — (AN B). 

Prove: VA, 3! B, AAB = 9 = BAA. 

Prove or disprove: For all A, B,C, x, x € (AAB)AC iff x is a member of exactly 
one of the sets A, B, C. 


(a) Is V (OR) transitive? In other words, is it always true that if AV B and BVC, 
then A V C? Prove your answer. 

(b) Decide (with proof) whether ^A (AND) is transitive. 

(c) Decide (with proof) whether @ (XOR) is transitive. 
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3.4 Small Sets and Power Sets 


In our initial informal discussion of set theory, we used the notation {1,4,5,7} to describe 
a set whose members were 1, 4, 5, and 7. In this section, we give a precise formal definition 
of this notation. This enables us to understand several key technical issues involving sets — 
whether order and repetition matter in sets, how sets may appear as members of other sets, 
and the distinction between € (the set membership relation) and C (the subset relation). 
We also discuss the power set of a set X, which is the set of all subsets of X. 


Small Sets 
We have already introduced the informal notation U = {21, 22,...,2n} to represent a finite 
set whose members are 21, 22,..-, Zn (and nothing else). We now give more formal definitions 


of what this notation means, for specific small values of n. To see why this formal definition 
is necessary, consider the following questions. 


(a) Is {1,2} = {2,1}? (Does order matter in a set?) 

(b) Is {3} = {3,3,3}? (Does repetition matter in a set?) 

(c) Is {0} = 0? (Is a set containing the empty set the same as the empty set?) 
(d) Is {1, {2,3}} = {1, 2,3}? (Do nested braces change the set’) 


To give incontrovertible answers to these questions, we need formal definitions to work with. 
As before, we define new sets by specifying precisely which objects are members of these 
sets. 


3.41. Definitions of Small Sets. 
(a) Empty Set: | For all z, z ¢ 0. | Using curly brace notation: | for all z, z ¢ { }. 


(b) Singleton Sets: For all a, z,|z € {a} | iff [z =a). 


(c) Unordered Pairs: For all a,b, z,|z € {a,b} | if |z =a or z=b}. 


(d) Unordered Triples: For all a,b,c, z, |z € {a,b,c} | iff} z=aorz=borz=c). 


We can give analogous definitions for {a,b,c,d}, {a, b,c, d,e}, etc. 


We have called {a,b} the unordered pair with members a and b, suggesting that order 
does not matter in a set. We can prove this fact now. 


3.42. Theorem. For all a,b, {a,b} = {b,a}. 
Proof. Fix arbitrary objects a,b. We must prove Vz, z € {a,b} = z € {b,a}. We use a chain 
proof. Fix an arbitrary object zo. We know 


zo E {a,b} & (z0 =a V zo = b) S (z0 = b V zo = a) & zo E {b,a}. 


The first and third equivalences come from the definition of unordered pair, and the middle 
equivalence is an instance of the tautology (P v Q) = (QV P). 


An analogous proof shows that for all objects a, b, c, 
{a,b,c} = {a, c, b} = {b, a,c} = {b, c, a} = {c,a, b} = {c, b,a}, 


and similarly for larger sets. In general, displaying the members of a set in a different order 
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does not change the set. Later, we discuss concepts (such as ordered pairs, ordered triples, 
and sequences) in which the order does make a difference. 

Next consider the issue of repetition of elements. The definition of unordered pair allows 
the possibility that a = b. In this case, we can prove that the unordered pair is the same as 
a singleton set. 


3.43. Theorem. For all a, {a,a} = {a}. 
Proof. Fix an arbitrary object a. We must prove Vz, z € {a,a} = z € {a}. We again use a 
chain proof. Fix an arbitrary object z 9. We know 


zo E {a,a} & (z0 =a V zo = a) > (20 = a) S zo € {a}. 


The equivalences follow by definition of unordered pair, by the tautology (PV P) & P, and 
by definition of singleton sets. 


Similarly, we can prove that for all a,b,c, {a} = {a,a,a,a}, {a,b,c} = {a,b,b,c,c,c}, 
and so on. In general, listing the members of a set with repetitions does not change the set. 
Combining this with the previous observation, we can summarize by saying that 


Order and repetition do not matter for sets; 
only membership in the set matters when deciding set equality. 


Next, let us turn to the nuances of the curly brace notation for small sets. Our first 
result shows that adding new braces to a set can change the set. 


3.44. Theorem. | {0} # 0| and {{O}} Æ {0}. 
Proof. We first prove {0} 4 Ø. To do so, we prove Az, z € {0}. Choose z = 0; we must prove 
Ø € {0}. By definition of singleton, we must prove Ø = Ø, which is a known fact (reflexivity 
of set equality). 

Next we prove {{0}} 4 {0}. Note that we can prove a set inequality A 4 B by finding 
a member of A that is not a member of B. In this case, A is {{@}} and B is {0}. Consider 
z = {0}. On one hand, by definition of a singleton set, z € A (i.e., {0} € {{O}}) because 
{0} = {0}. On the other hand, using the result proved in the last paragraph, z ¢ B (i.e., 
10} Z {0}) since {0} 4. 


If you find the proof just given to be cryptic, here is some intuition that may help. A set 
(collection of objects) should not be confused with the members of the set. In particular, 
a singleton set {b} containing a single object b is not the same thing as the object b itself. 
For example, the set {George Washington} is not the same object as George Washington 
(a set is not a person). We might think of a set as a box that contains its members (in no 
particular order). Thus, {1, 2,3} is a box containing the three numbers 1,2,3 and nothing 
else. On the other hand, S = {1, {2,3}} is a box containing two objects: the number 1, and a 
second box that itself contains the numbers 2 and 3. Question: is 2 € S? The answer may be 
unclear using the box analogy, so we refer to the formal definition. We know 2 € {1, {2,3}} 
iff (2 = 1 or 2 = {2,3}). Neither alternative is true, so 2 is not a member of S. However, 2 
is a member of one of the members of S. 

Similarly, T = {{1,2,3}} is a set with just one member, namely the set {1,2,3}. We 
have 1 ¢ T,2¢T,3¢T, but {1,2,3} € T. We think of T as a box containing another 
box containing the numbers 1, 2, and 3. Now, the empty set (which can be denoted by 
Ø or {}) can be viewed as an empty box — a box that contains nothing. This set is not 
the same as {0}, which is a box containing an empty box. Similarly, {{0}} = {{{}}} is a 
box containing a box containing an empty box. The set {, {0}} is a box containing two 
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things: an empty box, and a box containing an empty box. And so on. In summary, nested 
braces are important when determining set membership; only items at the outermost level 
are considered members of the outermost set. Sets may contain other sets as members. 


3.45. Example. Let U = {5.3, {7}, {8,9, {10, 11}}, Z}. U is a finite set with four members. 
One member of U is the real number 5.3. Another member of U is a set whose only member 
is 7. Another member of U is an unordered triple whose three members are 8, 9, and the 
unordered pair {10,11}. Another member of U is the infinite set Z. We could write 


U = {{7},Z, 5.3, {8, {11, 10}, 9}, Z, 53/10}, but U # {5.3, {7}, {8, 9, 10, 11}, Z}. 


Although 8 € {8,9,{10,11}} and 8 € Z, 8 g U. 


Power Sets 


Informally, for any set X, the power set P(X) is the set of all subsets of X. Formally, we 
define the power set by specifying its members, as follows. 


3.46. Definition: Power Set. For all objects S and all sets X,|S E€ P(X) | iff| SC X |}. 
So, the members of P(X) are precisely the subsets of X. 


3.47. Example. You can check that 


P({7,8,9}) = {0, {7}, {8}, {9}, {7, 8}, {7,9}, {8,9}, {7, 8, IHF. 


The right side is an eight-element set with members Ø, {7}, {8}, and so on. Changing the 
braces used here produces an incorrect statement. For instance, we know Ø C {7,8,9} by 
the Theorem on Subsets, and so Ø € P({7,8,9}). On the other hand, {0} Z {7,8,9}, since 
Ú is a member of {Ø} that is not a member of {7,8,9}. So {0} ¢ P({7,8,9}). Similarly, 
7 Z {7,8,9}, since the number 7 is not even a set’. But {7} C {7,8,9}, since every member 
of {7} (namely, the number 7) is a member of {7, 8, 9}. 


3.48. Example. What is P({0, {0}})? In general, for any unordered pair {a,b}, we can see 
~ P({a, b}) = {0, {a}, {b}, {a, b}}. 
Making the replacements a = @ and b = {0}, we conclude that 
P({O, {O}}) = {0, {0}, {105}, (0, {0} }}- 
Similary, since P({a}) = {0,{a}}, we see that P({O}) = {0, {O}}, PULO) = {0, {{O}}}, 


and so on. Finally, what is P(@)? Answer: {0}, not 0. 


We must take care not to confuse the meaning of T € S and T C S. The statement 
T € S says that the object T (which might or might not be a set) is a member of the set 
S. The statement T C S says that every member of the set T is also a member of the set 
S, i.e., Vz,z€T > z € S. By definition, T C S means the same thing as T € P(S), but in 
general, T C S does not mean the same thing as T € S. 


| Actually, in an axiomatic development of mathematics from set theory, 7 (and every other mathematical 
object) is defined to be a certain set. Without knowing the definition of 7 (and 8 and 9), we cannot decide 
whether 7 C {7,8,9}. We ignore this technical point in this section. 
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3.49. Example. Let U = {5,6,7, {5,7}, {6}, {{5}}, {{5, 7}}, {5, {6}, 7}, {4}, 0}. What is 
U N P(U)? Solution. For each fixed z, we know z € UN P(U) iff (z € U and z € P(U)) 
iff (z € U and z C U). Thus, we are looking for all objects z that are simultaneously 
members of U and subsets of U. Let us test each of the ten members of U to see which of 
them are also subsets of U. The first three displayed members of U, namely 5, 6 and 7, are 
numbers, not subsets. The next member of U is {5,7}, which is also a subset of U because 
5 € U and 7 € U. Similarly, {6} € U and {6} C U (because 6 € U). On the other hand, 
{{5}} € U but {{5}} Z U because {5} is a member of {{5}} that is not a member of U. 
Next, {{5, 7}} € U and also {{5,7}} C U, where the second fact holds because {5,7} € U. 
We see that {5, {6},7} C U since each of the three members of the set on the left is a 
member of U. But {4} Z U since 4 ¢ U. Finally, Ø C U. In summary, 


U A P(U) = {{5, 7}, {6}, {{5; TH}, £5, {6}; 7}, OF. 
3.50. Theorem on Power Sets (Optional). For all sets A and B: 
(a) Ø € P(A) and A € P(A). 
(b) AC B iff P(A) C P(B). 
(c) P(AN B) = P(A) N P(B). 
(d) P(A) U P(B) € P(AU B). 
(e) P(A— B) — {0} C P(A) — P(B). 


This theorem can be proved more quickly if we remember facts proved earlier in the 
Theorem on Subsets. We illustrate by proving parts (a), (b), and (d). Let A and B be 
fixed, but arbitrary sets. To prove part (a), we must prove Ø C A and A C A. Both parts 
are already known from the Theorem on Subsets. For part (b), we first prove that A C B 
implies P(A) C P(B). Assume A C B; prove P(A) C P(B). [Continue with the subset 
template.] Fix an arbitrary object 5; assume S € P(A); prove S € P(B). We have assumed 
that S C A; we must prove S C B. Now, from S C A and A C B, we deduce that S C B 
(by transitivity of C, part of the Theorem on Subsets). 

For the converse in part (b), we now assume that P(A) C P(B), and prove that A C B. 
Fix x; assume x € A; prove x € B. [Here we need a creative step — how can we use our 
assumption about the power sets to gain information about x? The key is to realize that 
{x} is a subset of A, hence is a member of P(A).] We claim that {x} C A. To prove this 
claim, fix y, assume y € {x}, and prove y € A. Our assumption tells us that y = x, so that 
y € A by assumption on x. It follows from the claim and the definition of power set that 
{a} € P(A). We have assumed P(A) C P(B), so we deduce that {x} € P(B). This means 
{x} C B. Since we know x € {x} (as x = x), we conclude that x € B as needed. 

We continue by proving part (d). Fix an arbitrary object S; assume S € P(A) U P(B); 
prove S € P(AU B). We have assumed S € P(A) or S € P(B), so use cases. 

Case 1. Assume S € P(A); prove S € P(AU B). Since A C AUB is known, we see from 
part (b) of the theorem that P(A) C P(AU B). Now, because S € P(A), we deduce that 
S € P(AU B). 
Case 2. Assume S € P(B); prove S € P(AU B). Since B C AU B is known, we see from 
part (b) of the theorem that P(B) C P(A U B). Now, because S € P(B), we deduce that 
S € P(AU B). 
This completes the proof of part (d). The next example shows that we cannot replace C by 
= in part (d). 


3.51. Example. Disprove: for all sets A and B, P(A) U P(B) = P(AU B). 
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Solution. We must prove there exist sets A and B with P(A) U P(B) 4 P(AU B). Choose 
A= {1} and B = {2}. We know P(A) = {0, {1}} and P(B) = {0, {2}}, so 

P(A) U P(B) = {0, {1}, {2}}. 
On the other hand, AU B = {1,2}, so 


P(AU B) = {0, {1}, {2}, {1, 2}}. 


We see that {1,2} is a member of P(AU B) that is not a member of P(A) U P(B), so these 
two sets are unequal. 


We conclude with one last example of the distinction between € and C. We have proved 
that for all sets X, X C X. Using an axiom of set theory called the Axiom of Foundation, 
it can be proved that for all sets X, X ¢ X. (For details, see §3.7.) So, although every set 
is a subset of itself, no set is a member of itself. On the other hand, for all sets X, X € {X} 
and X € P(X). Similarly, “for all sets X, Ø C X” is true, but “for all sets X, 0 € X” is 
false. 


Section Summary 


1. Memorize these definitions: 
Empty Set: |Vz,z € 0. 


Singleton Sets: For all a,z,|z € {a}|iff [z= al. 


Unordered Pairs: For all a,b, z, |z € {a, b} | iff |z =a orz =b]. 


Unordered Triples: For all a,b,c, z,|z € {a,b,c} | iff} z =a or z= bor z=c\|. 


Power Set: For all S,X,|S € P(X) | iff} SC X |. 


2. Order and repetition do not matter for sets; only membership in the set matters 
when deciding set equality. For example, {1,2} = {2,1} and {3} = {3,3} and 
{4,4,4,5,5,6} = {6,6,5,4, 4}. 

3. Sets may contain other sets as members. Nested braces are important when deter- 
mining set membership; only items at the outermost level are considered members 
of the outermost set. Note carefully that Ø and { } denote the empty set, but {0} 
is a set with one member (namely, the empty set). 


4. Take care not to confuse € (the set membership relation) with C (the subset 
relation). Note x € B means x is a member of B, whereas A C B means every 
member of the set A is a member of the set B. In particular, A € A is always 
false although A C A is always true. For the set B = {4, 5, {6,7}}, we have 4 € B 
and 5 € B and {4,5} C B and {4,5} ¢ B, whereas {6,7} € B and 6 ¢ B and 
7¢B and {6,7} Z B. 


Exercises 
1. (a) Carefully prove that for all a,b,c, {a,b,c} = {b, a,c} and {a,b,c} = {a,c, b}. 
(b) Use (a) to deduce that for all x,y,z, {x,y,z} = {x,z,y} = {y,2,z} = 
{y, 2, £} = {z,v,y} = {z,y, a}. 
2. (a) Prove: for all a, {a} = {a,a, a, a}. 
(b) Prove: for all a,b,c, {a,b,c} = {a, b, b, c, c, c}. 
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3. Is the set equality {a,{b}} = {{a}, b} true for all objects a,b, for some objects 
a,b, or for no objects a,b? Explain carefully. 


4. Let S = {0,8,7/5, V2, Z, Q, {R}, {3,8}, P(Z)}. (Assume numbers are not sets 
here.) (a) Find all z, y € S such that x € y. (b) Find all x,y € S such that x C y. 


5. True or false? Proofs are not required, but expanding definitions may help you 
reach the correct answer. 

a) {1, 2,3,4,5} Æ {2, 4,5,3,1}. 

b) {1,2,3,4} = {1, 2, 2,3,3,3, 4, 4,4,4}. 


{5} € {5}. 
{5} € {{5}}. 
) 5 € {5, {5D7. 
) {5} € {5, {{5} 
i) {{5}} € (5, {{5} HF. 
J) 5S {5, {{5} FF 
k) {5} S {5, {{5} HF 
D {{5}} S {5, {5H} 
m) {{{5}}} € {5, {{5}}}. 
n) {5} € P({5, {5H}. 
o) {{5}} € PLS, {{5}}})- 
p) {{{5}}} € PLS, {{5}}}). 
a) {15} S P({5, {5} }}). 
r) {{{5}}} E P({5, {{5}}})- 
6. Compute each of the following power sets; be sure to use braces correctly. 
a) P({5, 6}) (b) PAS, {6}}) (©) PULLS, 6F}) (d) PLS, {5}, £5, (555) 
7. (a) If S has four members, how many members does P(S) have? 
b) If S has five members, how many members does P(S) have? 


8. Someone claims that {0,0} 4 {0} because a box containing two empty boxes is 
not the same as a box containing one empty box. Is this claim correct? Explain 
carefully. 


9. Let a,b,c be distinct objects that are not sets. Define 
S = {a, {b}, c, {{b}, c}, {a,c}, {{a, ch}, {{{a, cht}, (La, ch, b}, {1b}, a, {eh}, fa, {b}, ch}. 


a) How many members does S have? How many members of S are sets? 

b) List all members of S N P(S). 

c) List all members of SN P(P(S)). 

d) How many members does P(S) N P(P(S)) have? 

10. True or false? Explain each answer. Assume the variable X ranges over sets. 

a) 3X, 0 c X. (b) 3X, 0 € X. (c) SEX, X CO. (d) SEX, X €l. 

e) I! X, X € {0}. (f) YX, X C P(X). (g) 3! X, X € P(X). 

X,3!y,y € P(X). (i) 3!y, YX, y € P(X). 

11. Prove: VX,~({X} C X). 

12. Prove part (c) of the Theorem on Power Sets: for all sets A and B, P(AN B) = 
P(A) O P(B). (This can be done very quickly using part of the Theorem on 
Subsets.) 


13. (a) Prove part (e) of the Theorem on Power Sets: 
for all sets A and B, P(A — B) — {0} C P(A) — P(B). 
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14. 
15. 
16. 


17. 
18. 


19. 


20. 


21. 


22. 


23. 
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(b) Disprove: for all sets A and B, P(A — B) — 0 C P(A) — P(B). 
(c) Prove or disprove: for all sets A and B, P(A) — P(B) C P(A- B). 
Prove from the definitions: for all sets X and all objects a, a € X iff {a} € P(X). 
Let S be an arbitrary set. Find all members of P(S)N P({S}). 
(a) Prove Va, Vy, {£} = {y} © xz =y. 
(b) Prove Yz, Yy, Yz, {x,y} = {z2} & (z =£ ^z =y). 
(c) Prove Va, Vy, Vz, Vw, {x,y} = {z,w} & ((x =z ^y = w) V (z =w^y=z)). 
Prove: Va, b,c, d, {{a}, {a,0}} = {{c}, {c,d}} © [(a=0) A (b= d)]. 
Give a fully detailed proof of the following fact, used in this section: for all a, b, 


P ({a, b}) = {0, {a}, {b}, {a, b}}. (You may need proof by cases. Note that for any 
set S, you know a € S or a ¢ S; and you know b E€ S or b £ S.) 


(a) Prove: 


VS,We £ S,YT,T € P(S U {x}) iff [T € P(S)@ (z € TAT — {2} € P(S))]. 


(b) Explain informally what the statement proved in part (a) means. In particular, 
for finite S, how does the size of P(S U {x}) compare to the size of P(S)? 


A set S is called €-transitive iff for all x,y, if x € y and y E€ S, then z E€ S. 

a) Prove: For all sets S, S is €-transitive iff (Yy, y € S => y C S) iff S C P(S). 
b) Is @ €-transitive? Explain. 

c) Is {0} €-transitive? Explain. 

d) Is {{0}} €-transitive? Explain. 

a) Prove: for all S, T, if S and T are €-transitive, then SMT is €-transitive. 
b) Prove or disprove: for all S, T, if S and T are €-transitive, then S U T is 
€-transitive. 

c) Prove: for all €-transitive sets S, S U {S} is €-transitive. 


Give an explicit example of an €-transitive set S with exactly five distinct mem- 
bers. 


Prove or disprove: for all €-transitive sets S, P(S) is €-transitive. 
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3.5 Ordered Pairs and Product Sets 


This section introduces the concept of an ordered pair (a,b). In contrast to the unordered 
pairs studied earlier, (a,b) is not the same object as (b,a) when a 4 b. Ordered pairs are 
used in analytic geometry and calculus to represent points in the plane. We also define a 
product operation for sets to build sets of ordered pairs such as the zy-plane R? = R x R. 


Ordered Pairs and Intervals 


Recall that the unordered pair {a,b} is a set with a and b as members. We proved that 
{a,b} is the same set as {b,a}; order does not matter in a set. In many situations, however, 
order does matter. We now introduce a new undefined term, called an ordered pair, for such 
situations. 


3.52. Undefined Term: Ordered Pair. For any objects a and b, the notation (a,b) is 
called the ordered pair with first component a and second component b. 


We always use round parentheses for ordered pairs and curly braces for unordered pairs. 
The following axiom states the main property of ordered pairs. 


3.53. Ordered Pair Axiom. For all objects a, b, c, d,| (a,b) = (c, d) | iff} (a = c and b = d) |. 


In words, two ordered pairs are equal if and only if their first coordinates agree and 
their second coordinates agree. Thus, for instance, (3,5) 4 (5,3), but (1 + 1,2 + 2) = 
(3 — 1,2 x 2). The components of an ordered pair can be any objects — integers, real 
numbers, sets, or even other ordered pairs. For instance, ({1,2},(2,1)) is an ordered pair 
whose first component is the set {1,2} and whose second component is the ordered pair 
(2,1). For now, we postulate that ordered pairs, sets, and numbers are different kinds of 
objects. So, for instance, ((1,2),3) # (1, (2,3)) because “(1,2) = 1 and 3 = (2,3)” is false. 

Remarkably, it is possible to define ordered pairs in terms of the previously introduced 
concepts of unordered pairs and singleton sets. Then ordered pairs are certain sets, and the 
Ordered Pair Axiom can be proved as a theorem. We give the details of this process in an 
optional subsection at the end of this section. 

We now introduce interval notation for certain subsets of R. Intuitively, for given a,b € R, 
the closed interval [a,b] consists of all real numbers x satisfying a < x < b. The open interval 
(a,b) consists of all real numbers x satisfying a < x < b. Half-open intervals and intervals 
extending to +00 are defined similarly. The formal definition is as follows. 


3.54. Definition: Intervals. Let a,b be fixed real numbers. 


(a) Closed Intervals: For all z, | z € [a,b] | iff}z¢ Randa<z<b|. 


(b) Open Intervals: For all z, | z € (a,b) | iff} z € Randa<z <b}. 


(c) Half-Open Intervals: For all z, | z € (a, 6] | iff} z E€ R anda < z < bl; and 


z€[a,b)/iff}zeRanda<z<bi. 


(d) Infinite Intervals: For all z, |z € (—co,a]| iff |z ER and z<a} We define (—~w,a), 


[a, co), and (a, 00) analogously. Finally, (—0oo, 00) is the set R. We remark that the symbols 
oo and —oo are not real numbers, and we do not define intervals such as [— oo, co]. 
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Note that the notation for open intervals conflicts with the notation for ordered pairs; but 
this notation conflict seldom causes problems, since context often tells us what is intended. 
Some authors avoid this conflict by writing Ja, b[ or (a,b) to denote an open interval, but 
we will not use this convention. We stress that curly braces, round parentheses, and square 
brackets are not interchangeable in mathematics: {a,b} denotes an unordered pair, (a,b) 
denotes an ordered pair or open interval, and [a,b] denotes a closed interval. The three 
pictures below illustrate the set {1.5,4}, the open interval (1.5,4), and the closed interval 
[1.5, 4], respectively. Note that open circles are used to indicate endpoints that are excluded, 
while closed (filled-in) circles indicate endpoints that are included. 


eo; | $ = 
0 1 2 3 4 

—_—_—_—_—_=<<X O —_ > 
0 1 2 3 4 

T S S, 


Product Sets 


Given two sets A and B, the product set A x B is the set of all ordered pairs (x,y) where 
x € A and y E€ B. We state this formally as follows. 


3.55. Definition: Product Set. For all sets A and B and all objects z: 


z E€ Ax B |if|3x, 3y, (x € A^y E B)Az=(a,y)}. 


It can be proved from this definition and the Ordered Pair Axiom that for all sets A 
and B and all objects x, y, 


(x,y) € Ax B/iff}~ e A and y € BI. 


This version of the definition is frequently used in proofs. 
3.56. Example. Given A = {1,2,3} and B = {3,5}, 
Ax B = {(1,3), (2,3), (3,3), (1,5), (2,5), (3, 5)}. 

On the other hand, 

Bx A= {(3,1), (3, 2), (3,3), (5,1), (5,2), (5,3)} #Ax B. 
We have (A x B)(B x A) = {(3,3)}. Note that AN B = {3}, and (AN B) x (AN B) = 
{(3,3)}. We have (A — B) x (B — A) = {1,2} x {5} = {(1,5), (2,5)}. On the other hand, 
(A x B) — (B x A) = {(1,3), (2,3), (1,5), (2,5), (3, 5)}. 
3.57. Example. Given A = {{1,2}, (3,4)} and B = {0, {3}}, 

Ax B = {({1, 2}, 0), ({1, 2}, {3}), ((3, 4), 0), (3, 4), {3})}- 
3.58. Example. Given C = {0,1}, we have C x C = {(0,0), (0,1), (1,0), (1, 1)}. We can 


Sets 133 


then compute 


(C x C) x C = {((0, 0), 0), ((0, 1), 0), ((1, 0), 0), ((1, 1), 0), 
((0, 0), 1), (0,1), 1), (1,0), 1), (1, 1), 1)}; 

C x (C x C) = {(0, (0, 0)), (0, 1, 0)), (1, (0, 0)), (1, (1, 0)), 

(0, (0, 1)), (0, (1, 1)), (1, (0, 1)), 1, (1, 1))F- 

These two sets are not equal; in fact, [((C x C) x C] N [C x (C x C) = 9. 


The product set R x R consists of all ordered pairs (x,y) with x € R and y € R. We 


write R? = R x R; more generally, define | A? = A x A | for any set A. We visualize R? as 
the zy-plane by drawing two perpendicular axes intersecting at a point called the origin, 
and identifying the ordered pair (x,y) with the point reached by traveling from the origin 
a directed distance of x horizontally and a directed distance of y vertically. Then, as seen 
in the next example, the product set [a,b] x [c,d] is a solid rectangle in the plane. 


3.59. Example. Draw the following sets in the ry-plane. 

(a) R = [1,3] x [-2, 2]; (b) S = (—2,2) x (1,3); (c) RUS; (d) RNS; 

(e) {1,3,4} x {10,2}; (£) ((1,3) x {2, 2}) U ({1, 3} x (2, 3). 

Solution. See Figure 3.1. The four edges of the rectangle are part of the set R in (a), but not 
part of the set S in (b). We use dashed lines and open circles to indicate excluded regions, as 
in (c) and (d). Part (e) is a finite set of nine ordered pairs, each of which is drawn as a filled 
dot. In part (f), the graph of (1,3) x {—2,2} consists of two horizontal line segments with 
both endpoints excluded; whereas the graph of {1,3} x (—2,2] consists of two vertical line 
segments with the top endpoints included. The union of these sets is shown in the figure; 
the interior of the rectangle is not part of the union. 


Properties of Product Sets 


The next theorem lists general properties of product sets, which can all be proved from the 
definitions using the proof templates. 

3.60. Theorem on Product Sets. For all sets A, B, C, D: 

(a) Monotonicity: If AC C and B C D, then Ax BCC x D. 


(b) Distributive Laws: 
Ax (BUC) = (Ax B)U(Ax C) and (AUB) x C= (Ax C)U(B x C) and 
Ax (BNC) = (Ax B)N(Ax C) and (ANB) x C= (Ax C)N (Bx C) and 
Ax (B-C)=(Ax B)-(AxC) and (A-— B) x C=(AxC)-(Bx ©). 
(c) Intersection of Products: (A x B)N(C x D) = (ANC) x (BND). 
(d) Union of Products: (A x B)U(C x D) C (AUC) x (BUD), 
but equality does not always hold. 
(e) Product of Empty Sets: Ax 0 =@ and Ø x B=9. 
(£) Criterion for Commutativity: Ax B= Bx A if (A= Bor A=9 or B =). 


We prove a few parts of this theorem to illustrate the techniques, leaving the remaining 
parts as exercises. 


3.61. Proof of Monotonicity. Let A, B,C, D be fixed, but arbitrary sets. We prove: if 
ACC and BC D, then Ax BC Cx D. [Start with the IF-template.] Assume A C C 
and B C D. Prove A x B C C x D. [Continue with the subset template.] Fix an arbitrary 


134 An Introduction to Mathematical Proofs 


(a) (b) (c) 


(d) (e) (£) 


FIGURE 3.1 
Solution to Example 3.59. 


object z; assume z € A x B; prove z € C x D. Our assumption means that z = (a,b) for 
some a € A and b € B. We must prove (a,b) € C x D, which means a € C and b € D. On 
one hand, since a € A and A C C, we deduce a € C. On the other hand, since b € B and 
B C D, we deduce b € D. 


The following figure gives some intuition for why monotonicity holds, though the figure 
cannot replace the formal proof. 


C 


3.62. Proof that ý x B =). Let B be an arbitrary set. [To show a set is empty, use proof 
by contradiction.] Assume, to get a contradiction, that Ø x B Æ Ø. This assumption means 
dz,z € Ø x B. In turn, this means that 3x, Jy, (x € OA y € B) Az = (x,y). In particular, 
x € Ú. But we know x ¢ Ú also, which is a contradiction. So Ø x B = Ø, after all. 


Before proceeding to the next proof, we describe a modification of the subset proof 
template that is often convenient when dealing with product sets. 
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3.63. Subset Proof Template to Prove Ax BCT. 
Fix an arbitrary ordered pair (x,y). 

Assume (x,y) € A x B, which means x € A and y € B. 
Prove (x,y) ET. 


The template works since the only objects that can belong to the product set A x B 
are ordered pairs. So we may as well fix an ordered pair at the outset, rather than fixing an 
object z and then saying da, Jy, (x E€ AAy E€ B)Az = (x,y). We illustrate this template in 
the next proof. 


3.64. Proof that A x (B — C) = (A x B) — (A x C). Let A, B, C be arbitrary sets. 
Part 1. Assume (2, y) is an arbitrary member of Ax (B—C'); prove (x,y) € (Ax B)—(AxC). 
We have assumed that x € A and y € B — C, so y € B and y ¢ C. We must prove that 
(x,y) € Ax B and (x,y) € A x C. The first part of what we must prove means that z € A 
and y € B, and both statements are known from our current assumptions. The second 
part of what we must prove is that “x ¢ A or y ¢ C.” Since y ¢ C was assumed, this 
OR-statement is true. 

Part 2. [We are about to prove (A x B) — (A x C) C Ax (B — C). We observe that every 
member of the left set belongs to A x B, hence must be an ordered pair. Thus we may 
begin part 2 by fixing an ordered pair rather than an arbitrary object.] Assume (x,y) is an 
arbitrary member of (A x B) — (A x C); prove (x,y) € A x (B — C). We have assumed that 
(x,y) € Ax B and (x,y) g A x C, which means (x € A and y € B) and (x € A or y ¢ C). 
We must prove x € A and y € B — C; i.e., we must prove x € A and y € B and y Ẹ C. 
The first two statements follow from the first parenthesized statement in our assumption. 
On the other hand, since x € A is true and “x ¢ A or y ¢ C” is also true, we deduce y ¢ C 
by definition of OR. 


3.65. Proof of Commutativity Criterion. Let A and B be arbitrary sets; we prove 
AxB=BxAif(A=Bo A=ģÍor B=b). 

Part 1. We prove: “if Ax B = B x A, then A = B or A = Í or B = Ọ” using a contrapositive 
proof. Assume A # B and A # and B # Q; prove A x B # B x A. Since A and B are 
nonempty, we know there exist objects aj € A and bọ € B. We also know AZ B or BEA, 
so use cases. 

Case 1. Assume A Z B, which means there exists c with c € A and c ¢ B. Consider the 
ordered pair (c, bo). On one hand, (c, bo) € A x B because c € A and bo € B. On the other 
hand, (c, bo) ¢ B x A since c ¢ B. Thus, A x B Z B x A since we have found a member of 
A x B that is not a member of B x A. 

Case 2. Assume B Z A. The proof is similar to the proof of Case 1, so we let you fill in the 
details of Case 2. 

Part 2. We prove: “if A = B or A = or B = f, then A x B = B x A.” Assume A = B or 
A = or B = f. Prove A x B = B x A. [We have assumed an OR-statement, so use cases.] 
Case 1. Assume A = B. Then Ax B= AxA=BxA. 

Case 2. Assume A = Ý. Then both A x B and B x A equal the empty set, by part (e) of 
the theorem, hence A x B = B x A. 

Case 3. Assume B = Ø. Then both A x B and B x A equal the empty set, by part (e) of 
the theorem, hence A x B = B x A. 


Formal Definition of Ordered Pairs (Optional) 


We mentioned earlier that ordered pairs could be defined in terms of the previously intro- 
duced concepts of singleton sets and unordered pairs. Here is the relevant definition. 


3.66. Definition: Ordered Pairs. For any objects a, b, define | (a,b) = {{a}, {a, b}} |. 
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3.67. Example. We have (3,5) = {{3}, {3,5}} and (5,3) = {{5}, {5, 3}} = {{5}, {3, 5}}: 
note that the set (3,5) does not equal the set (5,3). The ordered pair (4,4) is the set 
{{4}, {4, 4}}. Since {4, 4} = {4}, we can also write (4,4) = {{4}, {4}} = {{4}}. On the other 
hand, (4, {4}) = {{4}, £4, 4} 3}, whereas ({4},4) = (11435 4}, 48 = (4b {4 {4H} 
Thus, the three sets (4,4) and (4, {4}) and ({4}, 4) are all distinct. 


We can now prove the Ordered Pair Axiom as a theorem. 
3.68. Ordered Pair Theorem. For all objects a, b, c, d, (a, b) = (c, d) iff (a = cand b = d). 


Proof. Our proof uses the following three facts, stated in the exercises of §3.4. For all 
objects x,y,z,w: (1) {x} = {y} & x = y; (2) {x,y} = {2} (z= xAz = y); and 
(3) {x,y} = {z,w} & ((x = z ^y = w) V (x = w ^y = z)). Fix arbitrary objects a,b, c,d. 
Part 1. Assume (a,b) = (c,d). Prove a = c and b = d. By definition of ordered pairs, 
we have assumed {{a}, {a,b}} = {{c}, {c,d}}. Using Fact (3) with « = {a}, y = {a,b}, 
z = {c}, and w = {c,d}, we deduce that ({a} = {c} and {a,b} = {c,d}) or ({a} = {c,d} 
and {a,b} = {c}). [We have just written a known OR-statement, so use proof by cases.] 

Case 1. Assume {a} = {c} and {a,b} = {c,d}; prove a = c and b = d. Since {a} = {c}, 
Fact (1) gives a = c. Since {a,b} = {c,d}, Fact (3) gives (a = c and b = d) or (a = d and 
b = c). [This is another OR-statement, so use cases within Case 1.] 

Case la. Assume a = c and b = d; prove a = c and b = d. The two statements to be 
proved have already been assumed. 

Case 1b. Assume a = d and b = c; prove a = c and b = d. We already know a = c. 
Using the assumptions, we see b = c = a = d, so b = d. 

Case 2. Assume {a} = {c,d} and {a,b} = {c}; prove a = c and b = d. Since {a} = {c,d}, 
Fact (2) gives a = c and a = d. Since {a,b} = {c}, Fact (2) gives c = a and c = b. So, 
a = c and b = c = a = d, as needed. [We might initially think Case 2 cannot occur, since 
{a} is a one-element set and {c,d} is a two-element set. But when c = d, {c,d} = {c} is a 
one-element set after all! Compare to the example (4,4) = {{4}}.] 

Part 2. Assume a = c and b = d. Prove (a,b) = (c,d). By definition of ordered pairs, 
we must prove {{a}, {a,b}} = {{c}, {c, d}}. Since a = c, Fact (1) shows that {a} = {c}. 
Similarly, since a = c and b = d, Fact (3) gives {a,b} = {c,d}. Using Fact (3) again taking 
x = {a}, y = {a,b}, z = {c}, and w = {c,d}, we conclude that {{a}, {a, b}} = {{c}, {c, d}}, 
as needed. 


We should not get too distracted by the odd-looking definition of (a,b) or the logical 
subtleties of the preceding proof. The definition (a,b) = {{a}, {a,b}} has various bizarre 
consequences, such as (a,b) C P({a, b}). It is theoretically interesting that we can construct 
ordered objects from sets, which are intrinsically unordered. However, in applications of set 
theory to the rest of mathematics, the only important feature of this construction is the 
criterion for equality of ordered pairs stated in the Ordered Pair Theorem. That is why we 
can safely ignore the details of this definition, as we do in the main text, treating (a,b) as 
an undefined concept obeying the Ordered Pair Axiom. 


Section Summary 


1. Intervals in R. For all real numbers a,b and all z: 


Open Interval: |x € (a,b) | iff} a ER and a < x < b|. 


Closed Interval: |x € [a,b] | if|x ER anda<a<b}. 


Half-Open Interval: | x € [a,b) if |x ER anda<z<bl. 
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We define (a, b| and intervals extending to too analogously. 


2. Ordered Pairs. For all objects a,b,c,d, (a,b) = (c,d) iff (a = c and b = d). The 
round parentheses in (a,b) denote an ordered pair (or an open interval); the curly 
braces in {a,b} denote an unordered pair; the square brackets in [a,b] denote a 
closed interval. 


3. Product Sets. For all objects x,y, |(x,y) € Ax Bl|iff |x eA and y € B|. For all 


sets A, B and all objects z, |z € Ax B)iff}da,dy,ce A^y E€ BAz=(2,y) |. 


To prove that A x B C T, we may fix an ordered pair (x, y) satisfying x € A and 
y € B, and prove that (x,y) € T. 


4. Theorem on Product Sets. The product set construction has properties such as 
monotonicity, distributive laws, (A x B) N (C x D) = (AN C) x (BAD), 
(Ax B)U(C x D) C (AUC) x (BUD), and Ax = 9 = Ø x A. However, A x B 
does not equal B x A, except when A = B or one of the sets is empty. 


5. Formal Definition of Ordered Pairs (Optional). For all objects a,b, we can define 
(a,b) = {{a}, {a,b}} and use this definition to prove the Ordered Pair Axiom 
from facts about unordered pairs. 


Exercises 


1. Let A = {2,4,5}, B = {4,5,6}, and C = {1,4,6}. Compute: 
(a) AxB (b) Bx A (c) (Ax B)N(Bx A) (da) (Ax B)N(BxC) 
(e) (AN B)x (BNC) (f) (B-—C)x(C-—B) (g) (Bx A)-(Cx B) 
(h) [(B — A) x C] U[C x (A — B)]. 
2. Let A = {1,2} and B = {3,4}. Compute: 
(a) AxB (b) P(A) (c) P(B) (d) P(AxB) (e) P(A)xP(B) (f) (Ax B)xA 
(g) Ax (Bx A) (h) Are the sets found in (d) and (e) equal? Why? (i) Are the 
sets found in (f) and (g) equal? Why? 
3. Given A = {[0,3], Z, {2}} and B = {(1, {3}), {1,3}}, compute A x B and B x A. 
Let A = [-1, 2], B =[-1,3], C = [1,4], and D = [2,5]. 
a) Draw pictures of A, B, C, D, A — C, and B — D on a number line. Express 
A-C and B — D in interval notation. 
b) Draw pictures in R? showing A x B, C x D, (A x B) — (C x D), and 
A-C)x(B-D). 
5. Define intervals A = (0,5), B = [2,3), C = (—1, 2], and D = (1,7). 
a) Draw pictures of A, B, C, D, A— B, A— C, CUD, and D — B on a number 
line. Express A — B and D — B as unions of intervals. 
b) Draw pictures in R? showing (A — B) x (D — B), (D — B) x (A — B), 
A x D)- (B x B), and (D x A) — (C x C). 
6. Define A = |-3,3], B = (—2.5, 1.5), and C = {— 
the following sets: (a) AxB (b)CxC (c) (AxB 
e)RxXA (f)BxZ (g)CxR (h)ZxC. 
7. (a) Let S be the boundary of the rectangle in R? with corners (1,2), (5,2), (5,7), 
and (1,7). Describe S' as a union of product sets. (b) Give another description of 
the set S in part (a) using only interval notation, product sets, and set difference. 
8. Let T be the cube [0,2] x [0,2] x [1,3]. 
(a) What is the geometric significance of the set {0,2} x {0,2} x {1,3}? What 
about [0,2] x {2} x [1,3]? (b) Use set operations such as x and U to describe the 


2}. Draw pictures in R? of 


1,1,2}. D 
)A(ZxZ) (d) (A—C)x(B—C) 
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10. 


11. 
12. 


13. 
14. 


15. 


16. 


17. 


18. 


19. 


20. 


21. 
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set of points in R? lying on one of the six faces of T. (c) Similarly, describe the 
set of points in R? lying on one of the 12 edges of T. 


Assume a, b, c, d are fixed real numbers. Prove the following properties of intervals 
in R, and illustrate with sketches. 

(a) If a < b, then [a,b] — (a,b) = {a,b}. 

(b) Ifa<b<c, then [a,b] U (b, c] = fa, c]. 

(c) Ifa<b<c<d, then fa, d] — [b,c] = [a,b) U (c, d]. 

(d) If a < b, then (a, b] = (—020,b] N (a, ov). 

(a) Prove: for all sets A, B, C, D, (A x B)N(C x D) = (AN C) x (BND). 

(b) Draw pictures illustrating the identity in (a), taking the sets A, B, C, D to 
be specific closed intervals. 


Prove: for all sets A, A x 0 = @. 


Fix arbitrary sets A, B, C. 

a) Prove the distributive law (AU B) x C = (A x C)U (B x ©). 

b) Prove the distributive law A x (BNC) = (Ax B) A (A xC). 

c) Prove the distributive law (A — B) x C = (A x C) — (B x ©). 

Draw pictures in R? illustrating each identity in the preceding problem. 

a) Prove: for all sets A, B, C, D, (A x B)U (C x D) C (AUC) x (BUD). 

b) Draw a picture in R? to show that equality does not always hold in (a). 

c) Based on your picture, guess a formula expressing (A UC) x (BU D) asa 
union of other sets. Prove your formula. 

a) Let A = [0,5], B = [1,2], C = [0,6], and D = [8,4]. Draw a picture of 
A-—B)x(C—D). 

b) Repeat part (a) with A = C = [0,3], B = [2,4], and D = [-1, 1]. 

c) Repeat part (a) with A = C = {1, 2,3,4,5}, B = {1,2,4}, and D = {4,5,7}. 
d) For arbitrary sets A, B, C, D, guess a formula expressing (A — B) x (C — D) 
in terms of A x C and other sets. Prove your formula. 

Carefully sketch the set AU BUC, where: A = ([1,8] A Z) x (1,3], 

B = ((4, 5) U(7,8)) x {2}, and C = (([1, 6.5] — (3, 4)) — (5,5.5)) x {3}. 

(a) Prove: for any closed intervals J and J in R, 1M J is a closed interval or the 
empty set. (b) Prove: for any open intervals J and J in R, if IQ J Æ Ø, then IUJ 
is an open interval. (c) Prove or disprove the converse of the statement in (b). 


For any objects a,b,c, define the ordered triple (a,b,c) to be ((a,6),c). Use the 
Ordered Pair Axiom to prove: 


Va, b,c, d,e, f, (a,b,c) = (d,e, f) & [(a = d) A (b = e) ^A (c= f). 


For any objects a,b,c, we give an alternate definition of the ordered triple by 
letting (a,b,c) be the set {(1, a), (2, b), (3,c)}. Use this definition and properties 
of ordered pairs to prove: 


Va, b,c,d,e, f, (a,b,c) = (d,e, f) & [(a = d) A (b = e) ^A (c= F)]. 


Use Definition 3.66 to prove: (a) Va, b, (a,b) C P({a,b}). 

(b) YA, B, Ax BC P(P(AUB)). (c) Vz, y, z = y © !z,z € (x,y). 

Consider the following alternate definition of ordered pairs: for all objects a, b, 
define (a,b) = {{1,a}, {2,b}}. Prove the Ordered Pair Axiom holds using this 
definition of (a, b). 
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22. Suppose we try to define ordered triples by letting (a, b,c) = {{1, a}, {2, b}, {3, c}} 
for all objects a,b,c. Disprove: 


Va, b,c, d,e, f, (a,b,c) = (d,e, f) & [(a = d) A (b = e) ^A (c= f)). 


23. Suppose we try to define ordered pairs by letting (a, b) = {a, b, {a}} for all objects 
a and b. Prove or disprove the Ordered Pair Axiom using this definition of (a, b). 
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3.6 General Unions and Intersections 


Given two sets A and B, we have defined the union AU B and the intersection AN B of 
these sets. To take the union or intersection of finitely many sets, one way to proceed is to 
iterate the binary version of the operation. For instance, the union of five sets A, B,C, D, E 
could be defined to be (((AUB)UC)UD)UE. Because of the commutative and associative 
properties of union and intersection, the order of sets and placement of parentheses in this 
expression do not affect the answer. However, we cannot use this method to define the union 
or intersection of infinitely many sets. This section describes how to handle this situation, 
using the idea of an indexed collection of sets. We also introduce set-builder notation, which 
gives a concise way to define new sets. 


Unions and Intersections of Indexed Collections of Sets 


Let J be any nonempty set, called an index set. Each member i of I is called an indez. 
Suppose that for each index i € I, we have a set A;. Informally, the union of the sets Aj, 
as į ranges over I, is the set obtained by joining together all the members of all the sets 
A; into one giant set. The intersection of the sets A; is the set of all objects z (if any) that 
belong to all of the A;. Formally, we have the following definitions. 


3.69. Definition: Union and Intersection of an Indexed Family of Sets. Given a 
nonempty set J and a set A; for each i € I: 


(a) General Union: For all z, | z € U A, | iff | di € I, z € A; |. 
i€l 


(b) General Intersection: For all z, | z € N A; | iff Vi € I,z € Ail. 
icl 


For certain index sets J, variations of this notation are used. Specifically, if J = 
{1,2,...,n} for some positive integer n, we write 


icl i=1 icl i= 


If I = Zyo is the set of all positive integers, we write 


J A= LJ As [4 = NA: 
i=1 i=1 


icI ie] 
We use similar notation for I = Z,,, I = {b,b+1,...,c}, etc. We can also change the name 
of the index i to any unused letter; for instance, Ujer Ai = Uger Ak = Unes An- 


3.70. Example. Let the index set J be Ro, the set of positive real numbers. For each 
r € Ryo, let A, = {—r,0,r}. Then Uero Ar = R, whereas fN Ar = {0}. If we shrink 
the index set from Ryo to Z>o, then U An = Z. 


rER>o 
nEZ>0 


3.71. Example. Suppose J = Z„o, and for each n € J, let A, = [—n,1/n]. For instance, 
A; = [+1,1], A2 = [-2, 1/2], Az = [-3, 1/3], and so on, as shown in this picture: 
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y 


-4 -3 -2 -1 0 1 


It is visually apparent from this picture that JL; An = (—00, 1] and NZ; An = [-1, 0]. 
In particular, for any positive real number r, there is an E no with 1/no < r, so that 
r Z An, and hence r g NZ] An 


3.72. Example. Let I be the set of rational numbers larger than m. For each q € J, let 
Aq = [¢,00). On one hand, Myer A, = Í, so that the intersection of infinitely many infinite 
sets can be the empty set. On the other hand, Wes A, = (a,00), so that the union of 
infinitely many closed intervals can be an open interval. 

Let us check the claimed set equalities in this example in more detail. First, assume to 
get a contradiction that (),<7 Aq # 9. Then there exists zo such that zo € (jc; Aq. This 
means that for all q € I, zọ E€ Ag. So for every rational q > m, we have q < z% < oo. 
However, if we let q be the smallest integer larger than zp (such an integer exists and is a 
rational number larger than 7), we have zo < q, which is a contradiction. So the intersection 
is indeed empty. 

To check ee 144 = me oo), we prove set inclusions in both directions. To prove C, 
fix an arbitrary zọ € U dé 7 Aq, and prove zo € (7,00). We know there exists qo € J with 
zo € Ag- This means that qo E€ Q, qo > 7, zo € R, and qo < zo. Since m < qo < 2, we see 
that 7 < zo, proving zo € (7,00). To prove D, fix an arbitrary yo € (7,00). Then yo € R and 
T < yo. Assume we already know the theorem that between any two distinct real numbers 
there exists a rational number. So, we can find go E Q with m < qo < yo. Then yo € Ag, 
and so yo € Ue; Aq, as needed. 

We remark that if we replace I by a finite subset I’ = {q1 < q2 < +++ < qn}, then 
User Aq = (1,0) = Aq, and Neer Ag = (Gn, ©) = Aqn- 


3.73. Example. Take the index set to be J = Zyo. For fixed n € I, let An be the set of 
real numbers of the form k/n for some k € Z. Thus, An is the set of real numbers that 
can be written as fractions where the numerator is any integer and the denominator is n. 
Then UL; An = Q, whereas N; An = Z. When checking the latter equality in detail 


(see Exercise 4), it helps to first prove that Z C An for every n € I, and Z = Aj. 


Properties of General Unions and Intersections 


The next theorem states identities satisfied by the union and intersection operations for 
indexed collections of sets. Many of these properties are analogous to properties of binary 
union and binary intersection, which were given earlier in the Theorem on Subsets, the 
Theorem on Set Equality, and the Theorem on Product Sets. 


3.74. Theorem on General Unions and Intersections. For all nonempty index sets I 
and J and all sets X and A; and B;: 


(a) Monotonicity: If A; C B; for all i € I, then U Aj Cc U B; and N AC N Bi. 
i€l i€l i€l i€I 


(b) de Morgan Laws: X — (U a) = Na — A;) and X — (A A; J LI (X — Aj). 


tel wel tel tel 


(c) Distributive Laws: X N (U ai) = U N A;) and X U (A a) = NX U A;) and 


te. icl icl icl 
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X x (U a = (J(X x Bj) and X x (A a =(\(X x Bi); 


icl ic] icl ic 
similarly if X appears as the second input to N, U, or x. 


(d) Lower/Upper Bound: For all k € I, () A; C Ap and Az C U Aj. 


icI icI 
(e) Greatest Lower Bound: X C N A; iff (Vi € I, X C Aj). 
icl 
(£) Least Upper Bound: U Ai CX iff (Wie I, A; CX). 
ier 
(g) Enlarging the Index Set: If I C J, then U A; C U A; and N Ac N Aj. 
icI jEJ jeJ icI 


(h) Combining Index Sets: 


(Uae LJ4;| = U Ak and (iG (Ar) = N Ae: 


ie. JET keIUS te. jEJ keIUJ 


The proofs of these results provide excellent practice with the proof templates involving 
quantifiers. We prove some parts to illustrate, and leave the other parts as exercises. Fix 
arbitrary sets 140, J # 0, X, Ai, and B; (for i € I or i € J, as appropriate). 


3.75. Proof of Lower/Upper Bound Property. Fix an arbitrary index k € I. We first 
prove [ler Ai C Ap. Fix an arbitrary z; assume z € [l;e Ai; prove z € Ap. We have 
assumed Vi € I,z € A;. Since k € I, we can let i = k in this assumed universal statement 
to conclude (by the Inference Rule for ALL) that z € Ag, as needed. 

Next we prove Ay C Uez 4i. Fix w; assume w € Ax; prove w € Uj, Ai- We must prove 
Ji € I,w € A;. Choose i = k, which is a member of J. Our assumption gives w € Aj, as 
needed. 


3.76. Proof of First de Morgan Law. 

Part 1. We prove X — (Uer A;) C Mier(X — Ai). Fix z; assume z € X — (hee Ai); 
prove z € {),<;(X — A;). We have assumed that z € X and z ¢ U;cr Ai; we must prove 
z € Mier(X — Aj). Continuing to expand definitions, we have assumed that “di € I, z € A,” 
is false, so that “Vi € I,z ¢ A,” is true. We must prove Vi € I, z € (X — A;). Fix an index 
io € I. Prove z € X — Aio. We must prove z € X and z ¢ Ai. On one hand, z € X was 
already assumed. On the other hand, taking 7 = ig in the assumption that Vi € I,z ¢ Aj, 
we see that z ¢ Aj,, as needed. 

Part 2. We prove er, X —-A;) CX- (Uer Aj). The proof is similar to part 1; you are 
asked to supply the details in Exercise 8. 


3.77. Proof of Enlarging the Index Set. Assume I C J. First, we prove Uj;<,; Ai C 
Ujes Aj. Fix z; assume z € Ucz Ai; prove z € U;ez Aj. We have assumed Ji € I, z € Aj; 
we must prove Jj € J,z E€ Aj. Let io be a fixed index in J with z € Ai. Choose jo = io. 
We know ig € J and I C J, so jo = io € J. Also z € Aj, for this choice of jo. 

Next, we prove Mjes A; C Mieri (note the reversal of the inclusion here). Fix w; 
assume w € [jez 4j; prove w € [fier Ai. We know Vj € J,w E Aj; we must prove 
Vi € I,w € Ai. Fix io € I; prove w € Aj,. Since ig € I and I C J, we know ig € J. So we 
can take j = ig in the known universal statement to see that w € Ai, as needed. 


3.78. Proof of (Mier Ai) N (Mjes Ai) = Nkerus 4x. We prove this set equality using a 
two-part proof. 
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Part 1. [proving C] Fix z; assume z € (ep Ai) (Mjes Aj); prove z € Meru 4r- We have 
assumed z € (),-; Ai and z € jez Aj, which means Vi € I,z € A; and Vj € J,z € Aj. We 
must prove Vk € ITU J,z € Ag. Fix k € TU J; prove z € Ay. We know k € I or k € J, so 
consider two cases. 

Case 1. Assume k € I; prove z € Ax. This follows by taking i = k in the first universal 
statement in our earlier assumption. 

Case 2. Assume k € J; prove z € Ax. This follows by taking 7 = k in the second universal 
statement in our earlier assumption. 

Part 2. [proving 2] Fix w; assume w € [fkerug Ak; prove w € (Mier Ai) N (Mjes Aj). We 
have assumed Yk € TU J,w € Ap. We must prove w € [)jc7 Ai and w € jez Aj. First, 
we must prove Vi € I,w € Ai. Fix i € I; prove w € A;. Since I C ITU J, we know 
i € IUJ and hence w € A; (by taking k = i in the earlier assumption). Next, we must 
prove Vj € J,w € A;. Fix j € J; prove w € Aj. Since J C TU J, we know j € TU J and 
hence w € A; (by taking k = j in the earlier assumption). 


Set-Builder Notation 


Before concluding our study of set theory, we need to consider a notational device called 
set-builder notation that is frequently used to introduce new sets. Suppose P(x) is an open 
sentence with free variable x. Informally, the notation S = {x : P(x)} means that S is 
the set of all objects x such that P(x) is true. Similarly, if B is a given set, the notation 
S = {x € B : P(x)} means that S is the set of all objects x € B such that P(x) is true. 
Many variations of this notation are possible; for example, we could define product sets by 
writing Ax B = {(x,y):x E€ AAy€ B}. This formula could be read: “A x B is the set of 
all ordered pairs (x,y) such that x is in A and y is in B.” Here is a formal definition of the 
two basic forms of set-builder notation. 


3.79. Definition: Set-Builder Notation. For all open sentences P(x), all sets 
B, and all objects z, |z € {x: P(x)}| iff | P(z) is true; and |z € {xe B: P(x)}| iff 


z € B and P(z) is true |. The letter x can be replaced by any unused letter. For instance, 


{x : P(x)} = {y : P(y)} = tt: PŒ). 


Table 3.1 illustrates set-builder notation by redefining the main set operations of this 
chapter using this notation. You must become familiar with this notation, as it is used 
universally in mathematics texts. You might wonder why I have avoided this notation so 
far in this text (and why I use it only sparingly in later chapters). The reason is this: 
in many years of teaching proofs, I have observed that set-builder notation is one of the 
biggest stumbling blocks for beginning students trying to write proofs about sets. In a proof 
involving set unions, say, one must fluidly transition back and forth between the statement 
“z € AU B” and the equivalent statement “z € A or z € B” — so I use this equivalence as 
the very definition of set unions. The set-builder definition AU B = {z:z E€ AVz € B} 
means exactly the same thing, but the extra level of translation needed to use this definition 
in a proof seems to cause endless trouble. 
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AUB = {a:cEAVareEB} 
ANB = {a:cEAAxreEe Bh 
A-B = {a:xnEAAc¢ Bl ={xeEA:u ¢ B} 
P(A) = {85:5 C A} 
AxB = {(a,y):tE€ AAYE Bh ={ze: dre A, Jy € Byz =(a,y)} 
(a,b) = {te R:a<a<bd} 
[a,b] = {we R:a<a<b} 
UA: = {e:HeI,2€ Aj} 
icI 
[Q4 = {r:VicI, sE Aj} 
icI 
TABLE 3.1 


Definitions of Set Operations in Set-Builder Notation. 


Section Summary 
1. Definitions. Suppose I # is an index set and, for each 7 € J, A; is a given set. 


General Union: For all z, | z € U A; | iff | di € I,z € Al. 
tel 


General Intersection: For all z, | z € N A; | iff | Vi € I,z € A.. 
tel 


2. Properties of General Unions and Intersections. The general union and intersec- 
tion operations satisfy properties such as monotonicity, de Morgan laws, and the 
distributive laws. Also, (),-, A; is the largest set contained in every A;, whereas 
Ue, Ai is the smallest set containing every A;. Replacing I by a larger index set 
J produces a larger union and a smaller intersection. We can simplify a union of 
unions or an intersection of intersections by combining the index sets. 


3. Set-Builder Notation. Informally, {x € B : P(x)} is the set of all objects x in the 
set B that make P(x) true. Formally, for all open sentences P(x), all sets B, and 
all objects z,|z € {x € B: P(x)} | iff] z € BA P(z) |. Notation such as {x : P(x)} 


and {(x,y) : P(x,y)} is defined similarly. The letter x can be replaced by any 
unused letter; e.g., {x : P(x)} = {t: P(t)}. 


Exercises 


1. Let I be the set of odd integers. For n € I, define A, = (n — 1,n +1). Sketch 
and describe the set R — Uper An- 


2. For each index set I and sets A; below, describe U A; and N Ai. 
ie€l icI 
Do not prove your answers; it can help to draw pictures in R and R?. 
(a) I = Z>o; for each n € I, An is the half-open interval (1 + 1/n,5 — 1/n]. 
(b) I = [0,3]; for each r € I, A, is the open interval (r — 2,r + 2). 
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12. 


13. 


14. 
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(c) I = Ryo; for each r € I, A, is the rectangle [0,r] x [0,1/r]. 

(d) I = Ryo; for each r € I, A, is the set of (x,y) with (x — r}? +y? < r?. 

Let a,b be fixed real numbers with a < b. (a) Express the closed interval [a,b] as 
an intersection of infinitely many open intervals. (b) Express the open interval 
(a,b) as a union of infinitely many closed intervals. 


In Example 3.73, carefully prove the claimed set equalities UJ; An = Q and 
Aes An =Z. 


n=1 


For each n € Zyo, let 
A, = {m E Z : m divides n} and B, = {k € Z : n divides k}. 


a) Compute Ur", An and NZ] An. 

b) Compute UP, Bn and NZ; Bn. 

c) Compute Yr, (An N Bn). 

a) Suppose Aj, Ag,...,An,... are sets such that An C An+1 for all n € Zyo. 
For i < j, what are the sets 2 An and Mi An? 

(b) Suppose B1, B2,..., Bn,... are sets such that B, D Bn41 for all n € Zso. 
For i < j, what are the sets Bs Bn and ‘Que Bat 


( 
( 
( 
( 


Prove part (a) of the Theorem on General Unions and Intersections (monotonic- 
ity). 

(a) Finish the proof of the de Morgan Law in Proof 3.76. 

(b) Prove the second de Morgan Law in part (b) of the Theorem on General 
Unions and Intersections. 


(a) Prove part (e) of the Theorem on General Unions and Intersections (greatest 
lower bound property). (b) Prove part (f) of the Theorem on General Unions and 
Intersections (least upper bound property). 


Prove the distributive laws for general unions and intersections (part (c) of the 
Theorem on General Unions and Intersections). 


Use set-builder notation to describe the following sets: 
(a) (—00, 3] (b) [a,b) (c) the set of odd integers (d) Q. 


Draw pictures of these sets described in set-builder notation: 

(a) {zrER:xz>3Vzr< -—3} 

(b) {(z,y) ERXR:0<y<4-r?} 

(c) {(,y,z) € R? : max(læ], |yl, 121) = 1} 

(d) {(x,y) € [0,3] x [0,3]: £ +y € Z} 

Give specific examples of sets B, C R (for each n € Z>o) such that Bn D Bn+1 
for all n and NÇ; Bn is: (a) the empty set; (b) {7}; (c) [-1,1]; (d) Z. 

The goal of this problem is to get practice using known theorems to help prove 
new theorems. Fill in all the blanks in the proof below; the reasons for each step 
should be appropriate parts of the Theorem on Subsets, the Theorem on Set 
Equality, and the Theorem on General Unions and Intersections (excluding the 
first identity in part (h), which we are proving). 


Theorem. For all I,J 40, Ai, Aj, (U a) U U A] = U Ak. 
icI jEJ kEIUS 
Proof. Fix arbitrary sets I, J # Ø, A;, and Aj. 
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18. 


19. 
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Part 1. We must prove (U a) U U A;j| Cc U Ak. 


icI jEJ keIUJ 
We know I C IUJ (by (a) from the Theorem on Subsets), and so 
Ucr Ai © Ukeruz Ak (by (b) from the Theorem on General Unions and 
Intersections). Similarly, we know (c) (by (d) from the Theorem on 
Subsets), and so Uje z Aj © Uneruy Ax (by __ (e) _ from the Theorem on Gen- 
eral Unions and Intersections). Combining these two facts with (£) from 
the Theorem on Subsets, part 1 is proved. 


Part 2. We must prove U Ak C (U a) U U Aj 


kEIUJ i€l jes 
By (g) from the Theorem on General Unions and Intersections, we may 
instead prove the equivalent statement: 


Vk €E TUS, Ag T (Ua) u| UA; 


icI jEJ 


Fix k € IU J, so we know k € I or k € J. To prove Ax C (U;ez Ai) U (Uses As): 
use cases. 
Case 1. Assume k € I. First, Ay C Uje, Ai by (h) 

Second, User Ai © (User A) U (jes As) by) 

Therefore, Ak C (Uier Aj) U (jes A;) by G). 
Case 2. Assume k € J. First, Ak C Ujez Aj by (k) 

Second, Uses Aj C (Uier Ai) U (Uje7 Ay) by (1) ; 

Therefore, Ak C (Uicr Aj) U (jes A;) by (m) . 
Recall that an indexed collection of sets {9; : i € I} is called pairwise disjoint iff 
for all i, j € I, if i Æ j, then SiN S; = 0. Suppose Aj, Ag,...,An,... are sets such 
that An C An+, for all n € Zo. Define By = A; and B, = An — Ay_ for all n € 
Zs1. Prove that {Bn : n € Z>o} is pairwise disjoint, and UL} Bn = UZ] An- 
Illustrate with a Venn diagram. [Hint: If x € JL; An, then there exists a least 
positive integer no with x € Ano. Show that x € B,,.] 
Let {An : n E€ Zso} be an indexed family of sets. Define Bı = A, and, for each 
n > 1, define Bn = A, -U A,. Prove that {Bn : n € Z>o} is pairwise disjoint, 
and Ur, Bn = UP, An. Illustrate A, and Bn for 1 < n < 4 in a Venn diagram. 
Let {An : n € Zyo} be an indexed family of sets. Define B, = U= A; for each 
n € Z>o. Prove that Bn C Bn41 for all n € Zyo, and UFL; Bh = UZ A 


n=17°n° 
Let {Cn : n € Zso} be an indexed family of subsets of a given set X. Define 
Dn = Nj- Cj for each n € Z>o. Prove that Dy 2 Dn4i for all n € Z>o, and 
NV, Dn = NZ; Cn. Do this by using the result of the previous exercise and 


other known theorems. [Hint: Consider A, = X — Cy] 
(a) Prove or disprove: for all 7 4 Ø and all sets A;, Bi, 


UJiainB) co | U4] 0 (U as). 


i€l jel kel 
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(b) Prove or disprove: for all J 4 Ø and all sets A;, Bi, 


UJ4inB)2 | U4] o (U as). 


i€l jel kel 


A subset U of R is called an open set iff Va € U,ar € Rso, (aw —r,x +r) CU. 

a) Prove: and R are open sets. 

b) Prove: every open interval (a,b) is an open set. 

c) Prove: for all U,V C R, if U and V are open sets, then U N V is an open set. 
d) Prove: for any indexed collection {U; : i € I} of subsets of R, if U; is an open 
set for all i € I, then (ez Ui is an open set. 

A subset C of R is called a closed set iff R — C is an open set (see Exercise 20). 
a) Prove: Ø and R are closed sets. 

b) Prove: every closed interval [a,b] is a closed set. 

c) Prove: Z is a closed set. 

d) Find (with proof) a specific subset of R that is neither an open set nor a 
closed set. 

e) Prove: the union of any two closed sets is a closed set. 

f) Prove: the intersection of any nonempty family of closed sets is a closed set. 


An open disk is a subset of R? of the form {(x,y) : (a — h)? + (y—k)? < r°}, 
where h,k,r € R are fixed real numbers with r > 0. Express each of the following 
subsets of R? as a union of an indexed collection of open disks. 

(a) R? (b) {(z,y) € R?: 2 >O0Ay <0} (c) (-1,1)xR 

(d) (0,5) x (2,4) (e) {(a,y) € R: y > Jal} 


Show that the intersection of two open disks in R? is either empty or is the union 
of an indexed collection of open disks. 


Let X be a fixed set, and let A be a collection of subsets of X with the following 
properties: (i) @ € A; (ii) for all S € A, X — S € A; (iii) given {Sn : n € Zyo} 
with every Sn € A, UP, Sn € A. Prove: (a) X € A; (b) for any {Sn : n € Zso} 
with every Sp € A, NZ] Sn € A; (c) for all S,T € A, SUT € A; (d) for all 
STEA, SATEA. 

Let X = {1,2,3}. (a) Give three different examples of collections A of subsets of 
X satisfying conditions (i) through (iii) in the previous exercise. (b) How many 
such collections A are there for this set X? 
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3.7 Axiomatic Set Theory (Optional) 


So far in this chapter, we have defined several operations on sets and learned how to prove 
theorems involving these set operations. However, there is a subtle logical omission in our 
presentation of set theory so far. In many instances, we have introduced a new set (such as 
the union AUB, or the intersection ANB, or the power set P(A)) without giving any formal 
justification for why the new set actually exists. Such justifications may seem unnecessary, 
but (as we will see below) if we indiscriminately assume the existence of certain sets, logical 
paradoxes can result. 

A completely rigorous logical development of set theory (as opposed to the semi-formal 
treatment given in earlier sections) is called axiomatic set theory. In axiomatic set theory, 
we must use explicitly stated axioms to justify the existence (and uniqueness) of each set 
under consideration. This optional section gives a sketch of the framework and axioms 
for one version of axiomatic set theory called the ZFC (Zermelo—Fraenkel—Choice) system. 
More detailed introductions to axiomatic set theory can be found in the texts listed in the 
Suggestions for Further Reading at the end of the book. 


Undefined Concepts 


Axiomatic set theory begins with two undefined notions: the idea of a set, and the concept 
of membership in a set. One aspect of the theory that may seem unusual at first is that all 
objects under consideration are sets. You might think that we would need to allow other 
types of mathematical objects such as numbers, functions, geometric figures, and so forth. 
Remarkably, all of these objects can be defined as certain kinds of sets. It follows that all 
variables appearing in the axioms and formulas below represent sets. As above, we use € 
as a symbol for the undefined concept of set membership. Thus, for any two objects (sets) 
x and y, exactly one of the following alternatives holds: x € y (“x is a member of y”), or 
x Z y (“x is not a member of y”). You must learn not to be distracted by the fact that a 
lowercase letter is used for the set y. We could have equally well used uppercase letters for 
both these sets, or a mixture of lowercase and uppercase letters. 


Russell’s Paradox 


Our intuition tells us that, for any logical property, there should exist a set whose members 
are precisely the objects that satisfy the given property. Formalizing this intuition, we are 
led to propose the following axiom. 

3.80. Axiom of Abstraction (Tentative). Let P(z) be a statement with a free variable 


z, where the variable x does not occur in P(z). Axiom: | dz, Vz, (z € x) = P(z). 


This axiom is really a family of axioms, one for each statement P(z). The members 

of the set x in the axiom are precisely those objects (sets) z for which P(z) is true. In 
set-builder notation, we could write x = {z : P(z)} (although use of this notation suggests 
the set x is unique, which we have not proved yet). 
3.81. Example. Let P(z) be the statement “z 4 z.” By the Axiom of Abstraction, there 
is a set x such that for all z, z € x iff z # z. By properties of logical equality, z Æ z is false 
for any z. It follows that for all z, z ¢ x. So, we have used the axiom to prove the existence 
of an empty set x. 


3.82. Example. On the other hand, letting P(z) be the statement “z = z,” which holds 
for every z, we obtain a set x such that z € x for all sets z. This set x is a universal set 
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that contains all objects (sets) in the universe being studied. In particular, since x itself is 
a set, we have x € x (x is a member of itself). 


3.83. Example. Now let P(z) be the statement “z ¢ z.” The Axiom of Abstraction asserts 
that there is a set x for which the following statement is true: 


Vz, (z E€ x) (z € 2). 


In words, x is the set of all sets z such that z is not a member of itself. In the universal 
statement just written, we can choose z to be the particular set x, thereby obtaining the 
proposition 


But this proposition is false, since the statements on the two sides of the biconditional 
symbol must have opposite truth values. So, the Axiom of Abstraction has led us to a 
contradiction! 


The phenomenon in the last example is called Russell’s Paradox (named after its dis- 
coverer, Bertrand Russell). The only way to avoid this paradox is to give up the Axiom of 
Abstraction, despite its initial intuitive appeal, and replace it with more delicate existence 
axioms for sets. We describe these axioms below. 


The Axiom of Extension 


We now state the axioms of the ZFC theory. For each axiom, we give a brief discussion of 
the intuitive meaning of the axiom and how it is used. 


3.84. Axiom of Extension. | Vz, Vy, |z = y © (Vz,zEuezey)j. 


Intuitively, this axiom indicates the relationship between the logical equality of sets 
and the undefined concept of set membership. The axiom says that two sets x and y are 
equal iff they have exactly the same members. Recall that we defined x C y to mean 
Vz,z E€ £x = z € y. We also saw earlier that an equivalent way to formulate the Axiom of 
Extension is to say that x = y iff x C y and y C x. The Axiom of Extension is often used 
to verify the uniqueness of a newly constructed set, as we will see in examples below. 

We should mention here that the axioms of set theory are overlaid on top of certain 
logical axioms, which we have not listed explicitly. Some of these logical axioms describe 
the properties of the equality symbol =. For example, we might have axioms asserting that 
equality is reflexive, symmetric, and transitive (although we have seen that these properties 
can be proved from the Axiom of Extension). Some other axioms are needed to ensure that 
equal objects can be substituted for one another wherever they appear in a statement. In 
the context of set theory, it suffices to adopt these two Substitution Axioms: 


Yr, Vy, [£ = y > (VYz,zr E z Sy € z)|; 


Yr, Vy, [£ = y > Yz,z Ez & 2z €y). 


With these logical axioms already in place, the new content provided by the Axiom of 
Extension is the converse of the second Substitution Axiom: 


Yr, Vy, (Yz, z Ezr &zEy)>xz=yl]. 


Additional substitution properties involving defined concepts such as union and intersection 
can then be proved as theorems (see part (d) of the Theorem on Set Equality). 
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The Axiom of Specification 


Our first set construction axiom lets us create subsets of given sets, if we can state a 
condition that determines which objects belong to the subset. 


3.85. Axiom of Specification. Let P(z) be a statement with a free variable z, in which 
the variable x does not occur. Axiom: | Vu, 3x, Vz, |z € x & (z € uA P(z))). 


Intuitively, given any logical property P(z) and given a particular set u, we can form 
a new set x consisting of all objects (sets) belonging to u that have the property P(z). 
In set-builder notation, we could write x = {z € U : P(z)}. Note how closely this axiom 
resembles the forbidden Axiom of Abstraction. The key difference is that we must already 
have a set u given in advance, and then we use the property P(z) to select or “specify” 
which elements should belong to the new set x (which is automatically a subset of u). 


3.86. Example: Binary Intersections. Let us use this axiom to justify the existence of 
the intersection of two sets. Earlier, we “defined” AN B by saying that for all z, z € ANB 
iff z € A and z € B. To obtain this set by invoking the Axiom of Specification, we let P(z) 
be the statement “z € B” and we let u be the set A. The axiom tells us there is a set x 
such that for all z, z € x iff z € A and z € B. We define this set to be the intersection of 
A and B, denoted AN B. Before introducing this notation for the intersection, we should 
also check that the set x (whose existence is guaranteed by the Axiom of Specification) is 
uniquely determined by the given sets A and B. This follows from the Axiom of Extension. 
For suppose x’ is another set such that for all z, z € a’ iff z € A and z € B. We deduce 
that for all z, z € a iff z € a’, hence g = 2’. 


Similarly, whenever we use the Axiom of Specification to create a new set x, this set 
is uniquely determined by P(z) and u. For if another set x’ satisfies the conditions in the 
axiom, then for all z, z € a’ iff (z € u and P(z)) iff z € x. So z = a’ by the Axiom of 
Extension. It is now safe to use the notation x = {z € u: P(z)} to denote the set of all z 
such that z € u and P(z). On the other hand, the related notation {z : P(z)} must not be 
used. 


3.87. Example: Set Difference. Given any sets A and B, we previously “defined” A—B 
by declaring that for all z, z € A— Biff z € A and z ¢ B. The Axioms of Specification and 
Extension justify this definition by proving the existence and uniqueness of the set denoted 
A — B. This follows by writing A — B= {z € A: z ¢ B}. 


Let us see what happens if we try to recreate Russell’s Paradox using the Axiom of 
Specification. Let u be a fixed set, and take P(z) to be the statement “z ¢ z.” Our axiom 
provides a set x satisfying 

Vz, [z Exz&(zEuAnzg z). 


Taking z = x here leads to the statement 
LETS (LEUNATE T). 


If x € u, the statement just written must be false, as in the original discussion of Russell’s 
Paradox. But now we can escape the contradiction by concluding that x ¢ u. Since u was 
a perfectly arbitrary set, we have converted Russell’s Paradox into the following theorem. 


3.88. No-Universe Theorem. For every set u, there exists a set x with x ¢ u. 


This theorem tells us that, in the ZFC theory, there does not exist a “universal set” 
that contains every set. Note also that if such a set u did exist, the Axiom of Specification 
(using this u) would reduce to the Axiom of Abstraction. 


Sets 151 


The Axioms of Pairs, Power Sets, and Unions 


Our next three axioms allow us to combine several sets to produce a larger set. 


3.89. Axiom of Pairs. | Va, Vb, Jz, Yz, |z € x & (z =aVz = b)]. 


Informally, given any two sets (objects) a and b, there exists a set x whose members are 
precisely a and b. The Axiom of Extension shows that this set x is uniquely determined by a 
and b. So we can introduce the notation x = {a,b}; as before, we call this set the unordered 
pair with members a and b. Observe that we can take a = b here, in which case we obtain 
the singleton set {a} whose only member is a. By using the Axiom of Pairs on the sets {a} 
and {a,b}, we obtain the set {{a}, {a,b}}, which was our formal definition of the ordered 
pair (a,b) in Definition 3.66. In §3.5, we used this definition to prove that for all objects 
(sets) a,b, c,d, (a,b) = (c,d) iff a = c and b = d. 


3.90. Axiom of Power Sets. |Va,4z,Vz,|zeErezCal. 


Informally, given any set a, there exists a set x such that the members of x are precisely 
the subsets of a. By the Axiom of Extension, the set x is uniquely determined by this 
condition. As before, we write x = P(a) and call x the power set of a. 


3.91. Axiom of Unions. | Va, 3z, Yz, |z E€ x & ds € a,z € s]. 


Informally, given any set a, there exists a set x such that the members of x are precisely 
the members of the members of a. By the Axiom of Extension, the set x is uniquely de- 


termined by this condition. We write | x = Ja] and call x the union of the sets in a. The 


following examples may clarify what is going on here. 


3.92. Example. Suppose we have a set a = {b,c,d,e}. Applying the Axiom of Unions to 
this set, we get a set x = | Ja such that for all z, z € x iff z is a member of some member of 
a. As the members of a are b, c, d, and e, we see that z € x iff z € bor z €corz€dorz€e. 
Using informal notation from earlier in the chapter, this says that z € x iff z € bUcUdUe. 


3.93. Example: Binary Unions. Given any two sets B and C, define BUC = U{B, C}. 
In more detail, we use the Axiom of Pairs to obtain the unordered pair a = {B,C}, and 
then use the Axiom of Unions to obtain the set Ja. As in the preceding example, it follows 
that for all z, z € BUC iff z € B or z € C. Iterating this construction, we can define the 
union of three sets by letting BUCUD = (BUC)UD. Similarly, we can define the union of 
four sets, five sets, or any specific finite number of sets by repeated use of the binary union 
operation. 


3.94. Example. Suppose a = {Bo, Bi, Bo,...,Br,...} = {Bk : k € Zso}. (This is an 
informal example, since we have not formally discussed Z>ọ yet or defined what is meant 
by {Bp : k € Z>0}.) Here, x = Ua consists of all z such that z € Bk for some k € Zs. 
This is the set that we denoted earlier by Up» Bp- 


3.95. Example: Product Sets. Given any two sets A and B, we can now justify the 
definition of the product set A x B, which we defined earlier by the condition 


Vz,[2€ Ax BS dae A,db € B,z = (a,b)]. 


By the Axiom of Extension, there is at most one set satisfying this condition. But, how can 
we prove that A x B exists? To begin, recall that (a,b) = {{a}, {a,b}}. Given a € A and 
b € B, we know {a} and {a,b} are subsets of AU B, hence {a} and {a,b} are members of 
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P(AU B). Then (a,b), which is a set of two members of P(AU B), is a subset of P(AU B) 
and hence a member of P(P(AU B)). Finally, A x B is a set of ordered pairs, each of which 
is a member of P(P(A U B)), so that A x B is a certain subset of P(P(A U B)). To be 
precise, 


Ax B={2z€P(P(AUB)): dae A,lb € B,z = {{a}, {a, b}} }. 


We have seen that AU B exists using the Axiom of Pairs and the Axiom of Unions. Using 
the Axiom of Power Sets twice, we see that P(P(A U B)) exists. Finally, the displayed 
formula and the Axiom of Specification shows that the set A x B exists. We can iterate 
this construction to define the product of three sets, of four sets, and so on; for instance, 
AxBxC=(Ax B)xC. 


The Axioms of () and Infinity 


Based on the axioms stated so far, it is impossible to prove that there exists any set what- 
soever! The reason is that each set-construction axiom can only be invoked if we already 
have at least one set that is already known to exist (such as the set u in the Axiom of Spec- 
ification). The following axiom gets us started by postulating the existence of the empty 
set. 


3.96. Axiom of the Empty Set. |4z,Vz,z¢ x. 


Informally, there is a set x having no members. By the Axiom of Extension, if another 
set x’ satisfies Vz, z ¢ x', then x = x’. So we can denote the unique set x with no members 
by 0, and we can call x the empty set. 

Once we have the empty set, we can start applying the other set-construction axioms 
to build more and more new sets. Intuitively, we can get infinitely many distinct sets in 
this manner, but we claim that each individual set that we build is still finite. Reasoning 
informally (as we have not yet rigorously defined “finite”), the preceding claim holds since 
we can only obtain a given set by finitely many applications of the axioms, and each axiom 
stated so far produces a finite output set when applied to finite input sets. Thus, with the 
apparatus developed so far, there is no way to justify the existence of an infinite set such 
as Z or R. The next axiom resolves this difficulty. 


3.97. Axiom of Infinity. | 3x, [0 € x A Yz, (z E€ x > zU {z} € x)|. 


Although it may not be evident from an initial reading of the axiom, this axiom guar- 
antees the existence of an infinite set x. In fact, the implication appearing in the axiom will 
assure that all the natural numbers {0,1,2,...} (to be defined later as certain sets) appear 
as members of x. To give a hint of what is going on here, we mention that the natural 
number 0 is defined to be the empty set Ø, and for a natural number z (which is a certain 
type of set), z + 1 is defined to be the set z U {z}. Then the axiom is saying (among other 
things) that 0 € x, and for any natural number z € x, z + 1 also belongs to x. It should 
then be intuitively believable that Zs) C x (compare to the discussion of mathematical 
induction in the next chapter), so that the set x is infinite. 


The Axiom of Replacement 


Although our discussion of the Axiom of Infinity and Zs is not yet complete, let us suppose 
for a moment that we have constructed the set Zs9 = {0,1,2,3,...} within axiomatic set 
theory. Returning to a previous example, we could now hope to give meaning to the indexed 
union ther By, where Bo, B1, Bo,... are sets indexed by the nonnegative integers. We would 
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like to use the Axiom of Unions to define Up) Be = U{ Bx : k € Zo}. In order to do this, 
we need to justify the existence of the indexed set {Bp : k € Z>0o}. We can use the next 
axiom to accomplish this task. 


3.98. Axiom of Replacement. Let X be a given set, and let P(x, z) be a given statement 
with free variables x and z. Suppose that for each x € X, there exists a set y (depending on 
x) such that for all z, z € y iff P(x, z). By the Axiom of Extension, y is uniquely determined 
by x, so we denote y as y(x). Then there exists a set Y such that for all w, w € Y iff there 
exists x € X with w = y(x). 


Intuitively, the purpose of this axiom is to let us apply a “function” to all members of 
a given set X to create a new set Y. In this case, a “function” is not a set of ordered pairs 
(as discussed in later chapters), but rather a logical statement that tells us how each input 
set x € X is related to the corresponding output set y € Y. In more detail, our “function” 
F with domain X is defined by letting F(x) = y(x) = {z : P(x, z) is true} for each z € X. 
Part of the hypothesis of the Axiom of Replacement is that each output y(x), as x varies 
over X, is already known to be a set. Given that this hypothesis holds, the axiom asserts 
the existence of a new set Y = {F (x): x € X} = {y(x): x € X}, which is obtained by 
“applying F to each member of X” and collecting together all the resulting outputs into a 
new set. In other words, we obtain a new set Y from the set X by replacing each x € X by 
the set F(x). 


3.99. Example. This example assumes we have already constructed the sets Z>o and R 
within axiomatic set theory. Let P(x,z) be the statement (x € Z>o) A (z € R) A (—2a < 
z < x). By the Axiom of Specification, for each z € Z>o, the set y(x) = {z : P(x,z)} = 
{z E€ R: —2xr < z < xr} exists. So, the Axiom of Replacement transforms the known set 
Z>o into the set of closed intervals Y = {[—2x, a] : x € Z>o}. We should point out that, for 
this particular set Y, we can construct Y from other axioms without invoking the Axiom 
of Replacement. 


Generalizing the last example, if we have an index set J and a formula that characterizes 
a unique set B; for each i € I, then the Axiom of Replacement converts the index set I into 
the indexed set {B; : i € I}. Then, for instance, we could use the Axiom of Unions to obtain 
the set U;<, Bi- Also, provided I is nonempty, we could use the Axiom of Specification to 
obtain (jez Bj. 


The Axiom of Choice 


Suppose X is a set of nonempty sets. Then each x € X is a nonempty set, so there exists 
at least one object (set) y with y € x. The y appearing here depends on x, and y is usually 
not unique. In many parts of mathematics, it is technically helpful to have access to a set 
G of ordered pairs with the property that for each x € X, there is exactly one ordered pair 
(x,y) € G with first component x, and for all (x,y) € G we have y € x. In order to prove 
the existence of such a set G, we need a new axiom, called the Axiom of Choice. The name 
arises because the set G represents a simultaneous choice of one element y from each set x 
in the given collection of sets X. 


3.100. Axiom of Choice. | VX, [0 Z X > JF, Yx € X, 3!y € x, (x,y) € F]. 


The set F provided by the axiom may have “too many” elements. For example, F could 
have members that are not ordered pairs, or members of the form (a,b) with a ¢ X, or 
members (x, z) with « € X and z ¢ x. However, using the Axiom of Specification, we can 
shrink F to a set G having the properties stated above (see Exercise 15). 
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The Axiom of Choice may seem intuitively obvious, since we can readily imagine forming 
a collection of ordered pairs (x, y) by considering each nonempty set x in turn and arbitrarily 
choosing some member y € x. However, being intuitively convinced that a set ought to exist 
is not the same as being able to prove that such a set exists based on explicitly stated axioms. 
In some special situations, we can avoid the Axiom of Choice, by using a specific choice rule 
(in the form of a logical statement) together with the Axiom of Specification to build the 
set of ordered pairs G. For example, if every x € X is a nonempty subset of Z>9, we could 
define 


G = {z € X x Z>o : Ix € X, Jy E Z>0, z = (x,y) Aye 1 AYw E x,y < w}. 


This definition works because of the fact (discussed in more detail later) that every 
nonempty subset of Z>o has a unique least element. 

It is a famous and difficult result of axiomatic set theory that the Axiom of Choice 
is independent of the other axioms in the ZFC theory, assuming that those axioms are 
themselves consistent. This means that we could add either the Axiom of Choice, or the 
negation of this axiom, to the other axioms without introducing new contradictions. In 
particular, it is impossible to prove the Axiom of Choice (or its negation) as a consequence 
of the other axioms, unless the other axioms are inconsistent (in which case every statement 
could be proved from those axioms, and the entire theory is worthless). 

There are many logically equivalent versions of the Axiom of Choice, some of which are 
discussed later in this text. Two such versions are: the product of any family of nonempty 
sets is nonempty; and for any set X, there exists a well-ordering < on X. Many key theorems 
in various areas of mathematics require the Axiom of Choice in their proofs. Some of these 
theorems include: the union of countably many countable sets if countable (proved in §7.3 
when we discuss cardinality); every vector space has a basis (proved in linear algebra); the 
product of any collection of compact topological spaces is compact (proved in topology); 
every commutative ring with identity has a maximal ideal (proved in abstract algebra); 
every consistent set of sentences has a model (proved in formal logic); and not every subset 
of R is a Lebesgue-measurable set (proved in real analysis). Most mathematicians today 
accept the Axiom of Choice in order to have access to these and many other theorems that 
depend on this axiom. 


The Axiom of Foundation 


The final axiom of the ZFC theory is the following odd-looking statement. 


3.101. Axiom of Foundation. | Va, [a 4 0 > (3b,b € a^a Nb = 4). 


In words, for any nonempty set a, a has a member b that has empty intersection with 
a. The hypothesis a Æ Ø is not surprising, since the empty set does not have any members. 
Next we give some intuition for what the conclusion means. 

We begin with an informal example involving the set Z>ọ = {0,1,2,3,...}. It is intu- 
itively evident that there does not exist an infinite strictly decreasing sequence of nonneg- 
ative integers 

Qo > a, > ag > > ap >ce:. 


(We discuss this point in more detail in later chapters.) Roughly speaking, the goal of the 
Axiom of Foundation is to rule out analogous strictly decreasing sequences in the world of 
sets. In this setting, the concept b < c for integers is replaced by the analogous concept 
x E y for sets. So, we are trying to rule out the existence of an infinite sequence of sets 
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Xo, £1, T2, %3,... Such that --- € £3 € £2 € £1 E x, i.e., such that £k+1 € £p for all k € Z>o. 
As special cases of this, we want to forbid a set x such that x € x (which would lead to the 
infinite sequence -x E€ x E x E€ x € x), as well as a pair of sets x,y such that x € y and 
y € x (which would lead to the sequence ---x Eye xe yeu ey), and so on. 

Thus, although the Axiom of Foundation is phrased as an existence statement, its true 
goal is to force the non-existence of certain pathological combinations of set membership 
relationships. We now sketch how this goal is achieved. Suppose (to get a contradiction) 
that there did exist specific sets x, with 7,4; € £p for all k € Zso. By the Axiom of 
Replacement, we can form the set a = {£p : k € Zso}. The set a is nonempty, since 
zo € a, so the Axiom of Foundation provides a set b € a with a N b = Ø. Now, b = xx, for 
some k € Z>o. We deduce rp41 E€ b and x41 € a, so that £k+1 E€ AND = Ø, which is a 
contradiction. 

The Axiom of Foundation is also called the Axiom of Regularity. 


Section Summary 


All of mathematics can be formalized within set theory. One development of set theory is 
based on the following axioms. 


3.102. Axioms of ZFC Set Theory. 
(a) Axiom of Extension: Vx,Vy, |x = y & (Yz,z E£ & z €y)]. 


(b) Axiom of Specification: For all open sentences P(z), 
Vu, da, Vz, |z € x & (z € u A P(z))]. 


c) Axiom of Pairs: Va, Yb, 3x, Yz, |z € £ & (z =aVz=b)]. 


d) Aziom of Power Sets: Va, 3x, Yz, |z E £ & z Cal. 


Axiom of Unions: Va, 3x, Yz, |z E€ £z & Js € a, z E€ s]. 
Axiom of the Empty Set: 3x, Vz, z x. 
g) Aziom of Infinity: Ix, [Ø € x A Yz, (z E€ x => zU {2z} € x)]. 


h) Aziom of Replacement: For all open sentences P(x, z) and all sets X, 
Va € X, Jy, Vz, (z E€ y & P(a,z))] > SY, Yw, [w € Y © Jx € X, Yz, (z € w & P(x, z))]. 


(i) Axiom of Choice: YX, [0 Z X > JF, Yx € X,4!y € x, (x,y) € F]. 
(j) Axiom of Foundation: Ya, [a 4 0 => (3b,b € a ^aNb = Ø)]. 


These axioms can be used to prove the existence and uniqueness of sets such as 0, Z>o, 
AUB, ANB, A-B, P(A), Ax B, User Ais Nez Ai, and so on. The No-Universe Theorem 
states that for every set u, there exists some set x Z u. The Axiom of Foundation rules out 
infinite descending chains of sets belonging to other sets: i.e., there do not exist sets x, (for 
ke Z>0) with £k+1 € £p for all k > 0. 


Exercises 


1. Prove or disprove: Va, Vy, [Vz,vEzeayez]>u=y. 


2. Use the axioms to prove: for all sets a,b,c, there exists a unique set x such that 
for all z, z € x iff z = a or z = b or z = c. (The set x is the unordered triple 


{a, b, c}.) 
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10. 


11. 


12. 


13. 


14. 


15. 


16. 


Tt 


18. 
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Use the axioms to prove: for all sets a,b,c,d,e, there exists a unique set x such 
that for all z, z € x iffz=aorz=borz=corz=dorz=e. 


Assuming that R and the order relation < on R have already been constructed, 
use the axioms to prove the existence and uniqueness of the intervals (a, b), [a, b], 
(a, b], and [a,b) for all real numbers a < b. 


Assuming that Z and the arithmetic operations and order relation on Z have 
already been constructed, use the axioms to prove the existence and uniqueness 
of the following subsets of Z: (a) Z<o; (b) the set of odd integers; (c) the set 
of prime integers (these are integers p > 1 such that the only positive integers 
dividing p are 1 and p). 


Assuming that R and Z5o have already been constructed, prove the existence of 
the set Y = {[-22, 2]: x E Zso} without using the Axiom of Replacement. 


Show that the Axiom of the Empty Set follows from the alternate axiom Jg, x = x 
and the other axioms of ZFC set theory. 


(a) Show that the Axiom of Pairs (as stated in this section) follows from the 
alternate version Va, Vb, dx, Vz, |(z = a V z = b) => z € a] along with the other 
axioms of ZFC set theory. (b) State and prove similar results for the Axiom of 
Power Sets and the Axiom of Unions. 


Compute: (a) UO (b) U{Ø} (c) UtO, {0}, {0} 3} (d) UP(S) (e) UA x B). 
For A # Q, informally define the set (] A by the condition: for all objects z, 
z € NA iff for all y € A, z € y. (a) Use the axioms to justify the existence and 
uniqueness of the set (N A. (b) In the case A = {B,C}, prove that NA = BNC. 
(c) Explain carefully why our informal definition cannot be used to define the set 
()9. In contrast, what is {0}? 


Prove or disprove each variation of the Axiom of Foundation. 

(a) Va, Ib,bE€ a ^aNb= f. 

b) Va, [a # 0 => 3b, a N b = 0). 

c) Va, [a #0 > 3b,bCa^anNb= f]. 

a) Use the Axiom of Foundation to prove Yz, x ¢ x without referencing Z>o. 

b) Similarly, prove ~x, Jy, (x E y A y E x). 

Suppose we try to define ordered pairs by letting (a,b) = {a, {a, b}} for all objects 
a and b. Prove or disprove the Ordered Pair Axiom using this definition of (a, b). 


Suppose we define the ordered pair (a,b) to be the set {{0, a}, {{0}, b}} (compare 
to Exercise 21 in §3.5). (a) Use the axioms to justify the existence and uniqueness 
of (a,b) for all sets a and b. (b) Use the axioms to justify the existence and 
uniqueness of A x B for all sets A and B. 


Sa oO 


In the discussion following the statement of the Axiom of Choice, show how to 
use the Axiom of Specification to convert the set F in the axiom to the set G 
described prior to the axiom. 


Rewrite the Axiom of Infinity without using the defined symbols 9, U, and {z}; 
use only logical operators, variables, =, and €. 


Show how the Axiom of the Empty Set can be deduced from the Axiom of Infinity 
and other axioms. 


Define 0 = 0, 1 = {0}, and 2 = {0,1}. Prove the existence of the set P({0, 1, 2}) 
without using the Axiom of Power Sets. 
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19. (a) Without using the Axiom of Pairs, prove the existence of a set S such that 
there exist exactly two objects a with a € S. (b) Use (a) to show that the Axiom 
of Pairs can be proved from the Axiom of Replacement. 

20. Consider a universe in which there exists only one object (denoted 0), and suppose 


0 € 0 is false. Assuming all quantified variables range over only the objects in 
this universe, which of the axioms of ZFC set theory are true? 


Taylor & Francis 
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Integers 


4.1 Recursive Definitions and Proofs by Induction 


We turn now from set theory to a closer examination of facts involving integers, including 
proofs by induction, integer division with remainder, prime numbers, and the theorem that 
every integer has a unique factorization into a product of primes. Proofs by induction are 
often needed when dealing with concepts defined recursively. So we begin by discussing 
recursive definitions. 


Recursive Definitions 


Let n be a fixed positive integer, and let x,a1,a2,...,@, be fixed real numbers. Consider 
the following informal definitions. 

(a) Factorials: Define n! =n-(n—1)-(n—2)-...-3-2-1. 

(b) Summation Notation: Define Xp] ak = a1 + a2 +++: +n. 

(c) Product Notation: Define Ik- ak = Q1 Q2: ...* Gn. 

(d) Exponent Notation: Define x” =x-x-...- x, where the right side has n copies of x. 


These definitions may seem perfectly legitimate, but they all contain a subtle flaw. Each 
definition uses the ellipsis symbol “...” to represent missing terms in the middle of each 
formula. The reader is supposed to know what terms are intended here by following the 
pattern of the terms that are present. But, in a formal development of these concepts, we 
cannot omit information in this way. One solution is to rephrase these definitions recursively. 
We introduce the idea of a recursive definition with the following formal definition of n!. 


4.1. Recursive Definition of Factorials. Define | 0! = 1 |. 


For each n € Zso, recursively define | (n + 1)! = (n+1)-n!}. 


This recursive definition begins with an initial condition, which provides an ordinary 
(non-recursive) definition for 0!. We are told that 0! = 1. Next, the recursive part of the 
definition gives us a formula for computing (n+1)! that assumes we already know the value of 
n!. For instance, taking n = 0 in the recursive formula, we see that 1! = (0+1)! = (0+1)-0! = 
1-1= 1. Next, taking n = 1 in the recursive formula, we get 2! = (1 + 1)! = (1 + 1)- 1! = 
2-1 = 2. Continuing similarly, we see that 3! = 3-2! = 3 -2 = 6, 4! = 4 - 3! =4-6 = 24, 
5! = 5-4! = 5-24 = 120, and so on. We can also iterate the recursive definition to recover 
the original informal definition; for instance, 


41=4.3!=4-3-2!=4-3-2-1!=4-3-2-1-0!=4-3-2-1-1. 


A basic recursive definition introduces a sequence of quantities f(0), f(1),...,f(m),... 
by specifying what f(0) is, and then telling us (for each n > 0) how to compute f(n + 1) 
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if we already know the value of f(n). In a more elaborate recursive definition, f(n + 1) 
might depend on any of the values f(0), f(1),...,f(m) preceding it in the sequence, not 
just f(n). The sequence might start at n = 1 instead of n = 0, or at any other fixed integer. 
We illustrate with a few more recursive definitions. As always, memorize all of these 
definitions! 


4.2. Definition: Exponent Notation. Let x be a fixed real number. 
Define | 7° = 1 |. For all n € Zso, define |z” +! = z” -x |. 


For example, x? = 2?-a2=a!-2-a2=29-e-x-"2=1-4-"-2=2-2-2, as expected. 


4.3. Definition: Summation Notation. Suppose we are given a real number a,, for each 


1 n+1 n 
n € Z>,. We define Se ap = a, |. For all n € Z>1, define oo ak = ( a) + Gn+1 | 
k=1 k=1 k=1 


Intuitively, the sum of the first n + 1 terms is found by taking the sum of the first n 
terms and adding a,4,. We can introduce variations of this definition for sums starting at 
index zero, such as 2e ak, or sums starting at any other integer. By convention, for all 
integers j < i, we let $% _;ak = 0. The summation index k can be replaced by any other 
letter that does not already have a meaning. For instance, Jz; ak = do", @j = J] as. 


j=l 
But, we could not write yy an, Since n is already being used as the upper limit of the 
sum. 


4.4. Definition: Product Notation. Suppose we are given a real number a, for each 


1 n+1 n 
n € Z> 1. We define II ay = a; |. For all n € Z>1, define II ak = ( a] -An+1 | 
k=1 k=1 k=1 


We could have defined powers and factorials using product notation. For any real x and 
positive integer n, you can check that x” = J J~; x and n! = JJ} j. 


The Induction Axiom 


In §2.1, we listed axioms describing some algebraic properties of Z (the set of integers). To 
prove facts about recursively defined concepts, we need a new axiom giving a fundamental 
property of the set Z>; = {1,2,3,...} of positive integers. 


4.5. Induction Axiom for Z>1. 
For all open sentences P(n), the following two statements are equivalent: 


(a) Yn € Z>1, P(n). 
(b) P(1) AVn € Zs1, (P(n) > P(n +1). 


Recall that the axioms of a mathematical theory do not need to be proved. Nevertheless, 
in the case of the Induction Axiom, we can provide an intuitive justification for the axiom 
that helps us understand what the axiom is saying. Let P(n) be a fixed open sentence. On 
one hand, it is a routine exercise using the proof templates to prove that statement (a) in 
the Induction Axiom implies statement (b). The real content of the axiom is the assertion 
that statement (b) implies statement (a). Suppose we know that statement (b) is true; why 
should we believe that statement (a) must follow? By using the Inference Rule for ALL 
(taking n = 1,2,3,... in statement (b)), we see that statement (b) contains the following 
information: 
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(1) P(1) is true. 
(2) If P(1) is true, then P(2) is true. 
(3) If P(2) is true, then P(3) is true. 
(4) If P(3) is true, then P(4) is true. 
(5) If P(4) is true, then P(5) is true. 
... and soon... 


Now, by (1) and (2) and the Inference Rule for IF, we deduce that P(2) is true. It then 
follows from (3) and the Inference Rule for IF that P(3) is true. It then follows from (4) 
and the Inference Rule for IF that P(4) is true. It then follows from (5) and the Inference 
Rule for IF that P(5) is true. Continuing this chain of reasoning forever, we see that P(n) 
is indeed true for every positive integer n, as asserted in statement (a). 

This argument can be informally visualized as follows. We imagine setting up an infinite 
row of dominos standing on edge, with one domino for each positive integer n. 


G@; aes 


Let P(n) be the statement “domino n falls down.” Statement (b) in the Induction Axiom 
says that domino 1 falls down; and for every positive integer n, if domino n falls down 
then domino n + 1 falls down. The axiom itself tells us that this information is enough to 
conclude that all the dominos fall down. The argument in the previous paragraph spells out 
in more detail how the cascade of falling dominos occurs. First domino 1 falls down, which 
causes domino 2 to fall, which causes domino 8 to fall, and so on. 

We hope that this intuitive argument convinces the reader of the truth of the Induction 
Axiom. The argument is not a proof of the Induction Axiom, however. This is because 
we would need to use the Inference Rule for IF infinitely many times to finish proving the 
universal statement Yn € Zs, P(n), and proofs are required to be finite in length. In a 
similar vein, we can think of the Induction Axiom as indirectly characterizing what the 
positive integers are: they are precisely the objects we get by starting with 1 and then 
adding 1 again and again, forever. 

Now that we have intuition for why the Induction Axiom is true, let us see how to 
use this axiom in proofs. The key idea is that if we need to prove statement (a) in the 
axiom (a universal statement about all positive integers), it is sufficient to instead prove the 
equivalent statement (b). We can use the proof templates for AND, V, and IF to generate 
the outline of a proof of statement (b). This leads to the following template for ordinary 
induction proofs, which should be carefully memorized. 


4.6. Template for Ordinary Induction Proofs. To prove Yn € Zs1, P(n): 

Say “We use induction on n.” 

Part 1. Prove P(1). 

Part 2. Fix an arbitrary integer no € Z>1. Assume P(no) is true. Prove P(no +1) is true. 


The two parts of this template arise since statement (b) in the Induction Axiom is 
an AND-statement. In part 2, we prove Vn € Zs1,(P(n) > P(n + 1)) by combining the 
generic-element proof template with the direct proof template for IF-statements. Part 1 of 
an induction proof is called the base case. Part 2 is called the induction step. The assumption 
P(no) in part 2 is called the induction hypothesis. 
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Returning to statement (a) and statement (b) in the Induction Axiom, we can now see 
that statement (b) will likely be easier to prove than statement (a), even though state- 
ment (b) appears to be more complex. The reason is that we are allowed to assume certain 
information in the course of proving statement (b) (namely, the induction hypothesis P(no)) 
to help us prove the target statement P(no + 1). We write no here to emphasize that no is 
a fixed integer (though in many proofs below, we drop the subscript to reduce clutter). We 
are not using circular reasoning when using P(no) to prove P(no + 1), since no and no + 1 
are different integers, and hence P(ng) and P(no + 1) are different statements. 


Examples of Induction Proofs 


We now give some examples illustrating the template for proofs by induction. Our first 
result states that the sum of the first n odd positive integers is n?. 


n 


4.7. Theorem on Sum of Odd Integers. For all n € Z>1, S "(2k —1)=n’. 

k=1 
Proof. We use induction on n. Here, P(n) is the summation formula Xp; (2k — 1) = n?. 
Base Case. We prove P(1), which says Si (Qk — 1) = 1°. [To prove this, we use the initial 
condition in the definition of summation notation, together with a chain proof.] We know 
ype (2k — 1) = 2-1-1=2-1=1=1?. 
Induction Step. Let n be a fixed positive integer. Assume P(n), i.e., assume >; (2k—1) = 
n?. Prove P(n +1), i.e., prove 3777} (2k — 1) = (n +1)”. [We now use the recursive part of 
the definition of summation notation, and another chain proof.] We know 


II 


n+1 n 
S > (2k — 1) Ses J t2(n+1)-1 (by definition of sums) 


k=1 k=1 
=n? +2(n+1)—1 (by induction hypothesis) 
=n? +2n+1=(n+1) (by algebra). 


This chain of known equations proves the target equation P(n + 1). 


The following diagram offers a visual justification of the formula Xp; (2k — 1) = n?. 


A square of area 5? has been decomposed into L-shaped pieces of areas 1, 3, 5, 7, and 9; 
hence, 1+3+5+7+9=25 = 5. 


Although the diagram provides compelling visual intuition for why the summation for- 
mula is true, it does not prove the formula for all positive integers n. At best, by considering 
the full diagram and the smaller squares inside it, the diagram proves the first five instances 
of the summation formula. 
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4.8. Theorem on Sum of Squares. For all n € Z>1, 5 k? = Diis Hen a an 

k=1 
Proof. We use induction on n. Here, P(n) is the statement X}; k? = n(n + 1)(2n + 1)/6. 
Base Case. We prove P(1), which says Sei k? = 1(1+1)(2-1+1)/6. [To prove this, we use 
the initial condition in the definition of summation notation, together with a chain proof.] 
We know X`}; k? = 1? = 1 =1(2)(3)/6 = 1(1 + 1)(2:1+1)/6. 
Induction Step. Let n be a fixed positive integer. Assume P(n), i.e., assume 


2 n(n+1)(2n+1) 
De- 7 


Prove P(n+1), i.e., prove Se k? = (n+1)([n+1)+1)(2[n +1] +1)/6. [We now use the 
recursive part of the definition of summation notation and another chain proof.] We know 


n+1 
Se- els 
k=1 


2 

z n(n + Hen +1) i i 1) (by induction hypothesis) 
41)(n(2n +1 1 

m (n+1)(n( “_ ) + 6(n + 1)) (by factoring n + 1 out) 
+ 1) (2n? 

= (n + 1)(2n* + 7n + 6) (by algebra) 


(n+ 1)(n + 2)(2n + 3) 


(n+ 1)? (by definition of sums) 


(by factoring) 


6 
5 
_ (n+1)(In +] eer +1]+1) (by algebra). 


This chain of known equalities proves the target formula P(n + 1). 


Section Summary 


1. Recursive Definitions. A recursive definition introduces a sequence of quantities 
f(0), f(1), ..., f(n), ... by defining f(0) explicitly and giving a formula for 
f(n + 1) involving the previous terms f(0), f(1),..., f(n). 


2. Sums, Products, Powers, and Factorials. Given ax E€ R and n € Z>1, recursively 


define 
1 n+1 n 
k=l k=1 k=1 
1 n+1 n 
[aaa Tee (IT) a 
k=1 k=1 k=1 
For x € R and n € Zso, define x° = 1, x"t! = g” . x, O! = 1, and (n + 1)! = 
(n+1)-n 


3. Induction Axiom. For any open sentence P(n), the statement “Vn € Z>1, P(n)” 
is equivalent to the statement “P(1) A Yn € Zsi, (P(n) > P(n+1)).” 


4. Ordinary Induction Proof Template. To prove “Vn € Zs, P(n)” by induction: 
Say “We use induction on n.” 
Step 1. Prove P(1) [the base case]. 
Step 2a. Fix n € Zs. Assume P(n) is true [the induction hypothesis]. 
Step 2b. Aided by this assumption, prove P(n + 1) is true [the induction step]. 
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Exercises 
1. Compute each quantity by expanding a recursive definition. 
4 4 5 
a) 6l (b) 2 (o) 5 (a) 0° (e) Xak ®© Mak? (8) a5 
h) Ii- 9 (2% ale 1). 
2. (a) Prove by induction: for all c € R and all n € Z>1, p1 c= CN. 
b) Find the value of S779 & 
3. (a) Prove by induction: for all n € Z>1, X p-1 k = ne + 1)/2. 
b) Prove by induction: for all n € Z>1, X p4 kK? = n? (n + 1)?/4. 
Thus the sum of the first n cubes is the square of the sum of the first n positive 
integers.] 
4. Prove by induction: for all n € Z>1, Pp; 2:3" = 3"t! — 3. 
5. Prove by induction: for all n € Z>1, Ņ p1 K2" = (n — 1)2”+! + 2. 
6. Prove: for all n € Z>1, Xz- k- k! = (n+1)!— 1. 
7. Prove: for all n € Z>1, [[k-1 (1 + (3/k)) = (n + 1)(n + 2)(n + 3)/6. 
8. (a) Find Xp REDD for 1 < n < 5. (b) Based on your answers to (a), guess the 
value of this sum for general n. Then prove your guess by induction. 
9. Use induction and the recursive deficiton to prove: 
(a) for all a € R and all n € Z>1, a” = [ [k a. 
(b) for all n € Z>1, n! = = [[;j- ide 

10. Find the error in the following proposed proof that for all positive integers n, 
n = n + 3. Proof by induction. Fix an arbitrary positive integer n; assume 
n = n + 3; proven+1 = (n+1)+3. Add 1 to both sides of the induction 
hypothesis to get n+ 1 = (n+3)-+1. By algebra, this becomes n+1 = (n+1)+4+3, 
completing the induction step. 

11. Identify the logical error in the following proposed proof template for an induction 
proof of Vn € Zs, P(n). Explain exactly how the proof template in the section 
differs from the one here. 

Part 1. Prove P(1) is true. 
Part 2. Assume P(n) is true for all positive integers n. Prove P(n + 1) is true for 
all positive integers n. 

12. (a) Prove: for all real a,b,c,d, (a+b) + (c + d) = (a + c) + (b + d). Justify every 
step using one of the axioms from §2.1. 

(b) Additivity of Sums. Suppose ax and bp are given r real numbers isa each positive 

integer k. Prove by induction: for all n € Z>1, S~ ak + bk) -Yar + Yh 
k=1 

Explain how part (a) is used in the proof. 

13. Let c and a, (for each positive integer k) be given real numbers. Prove by induc- 
tion: for all n € Z>1, x (cak) =c- Do 

k=1 

14. Without invoking the Induction S prove that statement (a) in the Induction 
Axiom implies statement (b). 

15. (a) Use the Induction Axiom to prove that the statement “Vn € Zs, P(n)” is 


equivalent to “P(1) A Ym € Z>1, P(m — 1) => P(m).” (b) Create a variation of 
the ordinary induction proof template based on the equivalence in part (a). 
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16. Use intuitive reasoning (such as a picture of dominos) to create a variation of 
the Induction Axiom leading to a method for proving a statement of the form 
Yn € Zeo, P(n). Make a proof template for your proof method. 


17. Let A; be a fixed set for each k € Zyo. (a) Formulate recursive definitions for 
Ug- Ax and Mp- Ak similar to the recursive definitions of sums and products. 
(b) Let In = {j € Z:1< j < n} for each positive integer n. Prove by induction: 
for all n € Z>1, Uki Åk = Ujer Aj and RES Ak = Ayer, Aj. 

18. We say that a set S C R has a least element iff dz € S,Vx E€ Siz < x. 
(a) Prove by induction on n: for all S C Zs, if SN {1,2,...,n} #0, then S has 
a least element. [You can use the fact that for all n € Zs , there is no integer k 
with n < k < n + 1.] (b) Prove that every nonempty subset of Zs has a least 
element. 
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4.2 Induction Starting Anywhere and Backwards Induction 


This section introduces some variations of the ordinary induction proof template that allow 
us to apply induction to more general situations. In particular, induction starting anywhere 
gives us a way to prove statements of the form Vn € Z>», P(n). Backwards induction lets 
us prove statements of the form Vn € Z<, P(n). Finally, we consider various methods 
for proving a universal statement Vn € Z, P(n) where n ranges over all integers (positive, 
negative, and zero). 


Induction Starting Anywhere 


Sometimes, we need to prove universal statements of the form Vn € Zso, P(n), which start 
at n = 0 rather than n = 1. The Induction Axiom and Induction Proof Template are readily 
adapted to this situation. The only change is that in the base case, we prove P(0) instead 
of P(1); and in the induction step, we fix n in the set Zo rather than fixing n in the set 
Z> . More generally, we have the following proof template for induction proofs starting 
anywhere. 


4.9. Template for Induction Proofs Starting Anywhere. Let b be a fixed integer. 
To prove Vn € Zs, P(n): 

Say “We prove the statement by induction on n.” 

Part 1 (Base Case). Prove P(b). 

Part 2 (Induction Step). Fix an arbitrary integer no E€ Z>». Assume P(g) is true. 
Prove P(no + 1) is true. 


Exercise 16 outlines how this proof template may be justified using the Induction Axiom. 


4.10. Example. Prove: for all n € Zso, 5 ga 2+], 

i=0 
Proof. We use induction on n. 
Base Case. We prove >So 2* = 2°+! — 1, We know Jf 2' = 2 =1=2-—1= 2+ _], 
Induction Step. Fix n € Zso. Assume X`% o 2 = 2+1 — 1. Prove Ji 2f = 2049+ 1, 
[Use a chain proof.] We know 


n+1 n 

5 2 — (È ”) + 2rt1 (by definition of sums) 

i=0 i=0 
=2"+1 LR (by induction hypothesis) 
“P (by algebra) 
= 2+1)+1 _ 1 (by definition of exponents). 


All of our examples so far have used induction to prove summation formulas. The next 
examples show how induction can be used to prove other kinds of statements. 


n 
4.11. Example. Prove: Vn € Z>3, II (1 =) = 2 : 
aes k n(n — 1) 
Proof. We use induction on n. 
Base Case. We prove Maa -—2)= 3 T By the initial condition in the recursive defi- 
nition of products and by arithmetic, we know 


ž ( ) 2 1 2 2 
Il 1 1 . 
jt k 3 3 6 36-1 
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2 


Induction Step. Fix an arbitrary integer n € Zs3. Assume [];_3(1 — 2) = WG): 


Prove [[pt3(1- 2) = . We know 


ii(-2)- 


2 
(n+1)(n+1-1) 


(by definition of products) 


l 
Eog 
3 
= 
GOON 
= 
| 
ead Ne) 
Se 
ee | 
, EON 
= 
| 
3 
+ 
— 
Ny 


k=3 
2 1 3 (by induction hypothesis) 
= induction is 
n(n — 1) n+1 eae me 
2 n—-1 
= by algeb 
n(n—1) n+1 (Barbra 
2 2 


Cea Ee (by algebra): 
4.12. Example. Prove: for all integers n > 4, n! > 2”. 
Proof. We use induction on n. [Here, P(n) is the statement “n! > 2”.”] 
Base Case. We must prove 4! > 24. By the recursive definitions, we know 4! = 24 > 16 = 24. 
Induction Step. Fix an arbitrary integer n > 4; assume n! > 2”; prove (n+ 1)! > 2”+1, [We 
use a chain proof for inequalities.] We know 


(n+1)!=(n+1)n! (by definition of factorials) 
> (n+ 1)2” (by induction hypothesis; note n + 1 > 0) 
>2-2” (since n +1 > 2 and 2” > 0) 
ae (by definition of exponents). 


[To explain the two inequalities in more detail, recall this theorem: Va € R, Vb € R,Vc € R, 
if a > band c > 0, then ca > cb and ac > be. The first inequality in the chain follows from 
this theorem by taking a = n!, b = 2”, and c = n + 1; note that the hypothesis a > b is our 
assumption in the induction proof, and c > 0 since n + 1 must be at least 5. Similarly, the 
second inequality follows from this theorem by taking a = n+ 1, b = 2 and c = 2”, which 
is allowed since we know n+ 1 > 2 and 2” > 0. Finally, by transitivity of >, the displayed 
chain of steps lets us deduce that (n + 1)! > 2”*!, which is the goal of the induction step.] 


The next example shows why we started the induction at n = 4 in the last proof. 


4.13. Example. Disprove: for all integers n > 1, n! > 2”. 

Disproof. We must prove In € Zs 1,n! < 2”. [Induction is not required here; we prove the 
existence statement by giving a single example.] Choose n = 3, which is in Z>1. We compute 
= 6 <8 = 2, 


More Examples of Induction Proofs 


4.14. Example. Prove that for all n € Z>o, 8 divides 5°” — 1. 

Proof. We use induction on n. [Here, P(n) is the statement “8 divides 5?” — 1.”] 

Base Case. We must prove 8 divides 57° — 1, i.e., that 8 divides 0. We must prove Jk € 
Z,0 = 8k. Choose k = 0; we know 0 € Zand 0=8-0. 

Induction Step. Fix n € Zo; assume 8 divides 5?” — 1; prove 8 divides 52("+) —1. We know 
Ja € Z,5?" — 1 = 8a; we must prove 3b € Z,5?"+? — 1 = 8b. [How can we choose b? The 
key is to rewrite 57"? in a way that lets us use the induction hypothesis.] On one hand, 
we know 57"+? — 1 = 5?” .5.5 — 1 = 25 . 52”? — 1. Now, from the induction hypothesis, we 
know 5?” = 8a + 1. Substituting, we get 5?”+? — 1 = 25(8a + 1) — 1 = 25(8a) + 25 — 1 = 
8(25a) + 24 = 8(25a + 3). Choose b = 25a + 3, which is an integer by closure. By the above 
algebra, we have 52°+2 — 1 = 8b, as needed. 
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The next example introduces the idea of a recursively defined sequence. The example 
also illustrates how we can gather data to guess a theorem that might be true, and then 
use induction to prove that theorem. 


4.15. Example. Define numbers a1, a2,...,@n,... recursively by setting aj = 5 and an41 = 
3a, — 4 for all integers n > 1. (a) Calculate a1, a2, a3, a4, a5. (b) Use the data in (a) to guess 
a non-recursive formula for an. (c) Prove the formula in (b) by induction. 

Solution. (a) We know a; = 5. Taking n = 1 in the recursive formula, we find az = 3a; — 4 = 
3(5) —4 = 11. Taking n = 2 in the recursive formula, we get a3 = 3a2 — 4 = 3(11) — 4 = 29. 
Continuing similarly, a4 = 3a3 — 4 = 3(29) — 4 = 83, and as = 3a4 — 4 = 3(83) — 4 = 245. 
(b) What is the pattern in the numbers 5, 11, 29, 83,245? We might eventually notice that 
these numbers are close to powers of 3, which are 3, 9,27, 81,243. In fact, we see that a, = 
3” + 2 for 1 < n < 5. We can verify that this formula continues to work for n = 6,7,8,... 
by calculating more values from the recursion. So we guess that an = 3" + 2 for all integers 
n > 1. (Notice that we cannot fully confirm this guess by calculating finitely many an from 
the recursion.) 

(c) We now prove Vn € Zs1,a, = 3” + 2 by induction on n. For the base case, we must 
prove a; = 3!+2. We know a; = 5 and 3!+2 = 5, so the base case is true. For the induction 
step, fix an arbitrary integer n € Z>1, assume an = 3" + 2, and prove an41 = 3"t' +2. We 
have the chain of known equalities: 


An+41 = 3an — 4 (by recursive definition of an+1) 
= 3(8" +2) -4 (by induction hypothesis) 
= 3(38") +3-2-4 (by the distributive law) 
=3"+1 42 (by definition of exponents). 


The goal of the next example is to prove one of the laws of exponents from algebra. 
This statement contains multiple universal quantifiers; we prove it by combining a generic- 
element proof with an induction proof starting at 0. 


4.16. Example. Prove: for all real numbers x and all integers m,n € Z>0, e™t" = a™a", 
Proof. We must prove Vz € R,Ym € Z>0,Yn € Z>0,£™ t” = x™ x". [We begin by using the 
generic-element proof template twice.] Let x be a fixed, but arbitrary real number. Let m 
be a fixed, but arbitrary nonnegative integer. |For the universal statement involving n, we 
use induction.] We prove by induction on n that ®t” = g™g”., 

Base Case. For n = 0, we must prove «+9 = 2x9. On one hand, 2+ = g™ since 
m+0 = m. On the other hand, "2° = 2”-1 = x™ by the initial condition in the 
definition of exponents and the multiplicative identity axiom. So the base case holds. 
Induction Step. Fix an arbitrary integer n > 0. Assume 2+” = g™g”, Prove e™+("+)) = 
c™e"*!, We know 


getn) — a(mtn)+1 (by associativity of addition 


) 
Sie (by recursive definition of exponents) 
= (a x” )a (by induction hypothesis) 
= g™(x" x) (by associativity of multiplication) 
=ar (by recursive definition of exponents). 


Backwards Induction and Induction on Z (Optional) 


So far, we have talked about induction proof templates for proving statements of the form 
Yn € Z>, P(n). What if we need to prove that P(n) holds for all integers n? We discuss 
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two ways to do this in a moment. First, we introduce backwards induction, which can be 
used to prove statements of the form Vn € Z<y, P(n). 


4.17. Template for Backwards Induction Proofs. Let b be a fixed integer. 

To prove Vn € Z<p, P(n): 

Say “We prove the statement by backwards induction on n.” 

Part 1 (Base Case). Prove P(b). 

Part 2 (Induction Step). Fix an arbitrary integer no E€ Z<». Assume P(g) is true. 
Prove P(no — 1) is true. 


See the exercises for intuitive and formal justifications of why this proof template works. 
With this template in hand, we can now describe the first method for proving a universal 
statement about all integers. 


4.18. First Template for Proving Vn € Z, P(n). 

Part 1 (Base Case). Prove P(0). 

Part 2 (Forward Induction Step). Fix an arbitrary integer no € Z>o. Assume P(no) is 
true. Prove P(no + 1) is true. 

Part 3 (Backward Induction Step). Fix an arbitrary integer mo E€ Z<o. Assume P(mo) is 
true. Prove P(mọ — 1) is true. 


This template works because “Vn € Z, P(n)” is logically equivalent to “Vn € Z>0, P(n)A 
Ym € Z<o, P(m).” We prove the latter statement by combining forwards induction (starting 
at 0) with backwards induction (also starting at 0). We could replace each occurrence of 0, 
here and in the proof template above, by any fixed integer b. 

Our second method for proving Yn € Z, P(n) first handles nonnegative integers by 
ordinary induction, and then proves the statement for each negative integer by reducing to 
the positive case dealt with earlier. This technique is useful when the statement P(—n) is 
closely related to the statement P(n). 


4.19. Second Template for Proving Vn € Z, P(n). 

Part 1 (Base Case). Prove P(0). 

Part 2 (Forward Induction Step). Fix an integer no E€ Z>ọo. Assume P(no) is true. Prove 
P(no + 1) is true. 

Part 3 (Proof for Negative Integers). Fix a positive integer no € Z>o. We have proved 
P(no) is true. Prove P(—no) is true. 


Refer to the exercises for a justification of this proof template. 


4.20. Example. We outline a proof that for all integers n, n is even or n is odd, asking 
the reader to supply the missing details. 

Part 1. Prove 0 is even or 0 is odd. Since 0 = 2-0, 0 is even, so the OR-statement is true. 
Part 2. Fix an integer n > 0. Assume n is even or n is odd. Prove n + 1 is even or n+ 1 is 
odd. This part is completed by considering two cases (see the exercises). 

Part 3. Now fix an integer n > 0. By part 2, we already know that n is even or n is odd. 
We must prove —n is even or —n is odd. Again this part can be completed by considering 
two cases. [The fastest way to finish is to use the previously proved results that n is even 
iff —n is even, and n is odd iff —n is odd.] 


Combining this example with Example 2.35 (where we showed that no integer is both 
even and odd), we have finally proved the theorem “every integer is even or odd, but not 
both.” This theorem was announced without proof in §2.3. We give another proof of this 
theorem a little later, using the Integer Division Theorem. 
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Section Summary 


1. Induction Starting Anywhere. To prove Yn € Z>», P(n) by ordinary induction: 
Step 1. Prove P(b) [the base case]. 
Step 2a. Fix n € Z>», and assume P(n) is true [the induction hypothesis]. 
Step 2b. Aided by this assumption, prove P(n + 1) is true [the induction step]. 


2. Backwards Induction. To prove Yn € Z<», P(n) by backwards induction: 
Step 1. Prove P(b) [the base case]. 
Step 2a. Fix n € Z<», and assume P(n) is true [the induction hypothesis]. 
Step 2b. Aided by this assumption, prove P(n — 1) is true [the induction step]. 


3. First Method to Prove Yn € Z, P(n). 
Step 1. [Base case] Prove P(0). 
Step 2. [Forward induction step] Fix n € Z>0; assume P(n); prove P(n + 1). 
Step 3. [Backward induction step] Fix m € Z<o; assume P(m); prove P(m — 1). 


4. Second Method to Prove Vn € Z, P(n). 
Step 1. [Base case] Prove P(0). 
Step 2. [Forward induction step] Fix n € Z>0; assume P(n); prove P(n + 1). 
Step 3. [Negative case] Fix n € Zs; note P(n) has been proved; prove P(—n). 


Exercises 


- 1 1 
1. Prove by induction: for all n € Z>2, II (1 — =) sal : 
k=2 


2. (a) Compute the product II (1 — z) for 4 < n < 7. (b) Based on the data in 


(a), guess the value of this product for general n > 4, and prove your guess by 
induction. 


3. (a) Prove by induction: for all real r Æ 1 and all n € Z>ọ, 


(b) Compute $ ;_or® when r = 1. 

4. Prove: Vz € Rso,Vn € Z>0, (14+ 2)" > 1+nze. 

5. (a) Prove: for all n € Zs5, n? < 2”. [To finish, you may need to prove a similar 
inequality involving a smaller power of n.] (b) Disprove: for all n € Z>1, n? < 2”. 

6. Prove by induction: for all n € Z>ọ, 6 divides 7” — 1. 

7. Telescoping Sums. For each k € Z, let ap be a fixed real number. Prove: for all 
i,j €Z, ifi < j, then $7 _;(ak+41 — ak) = aj41 — Gi. 

8. Laws of Exponents. Prove: Vz € R,V¥m E Zso,Vn € Zso,(x™)” = 2”. [Fix x 
and m and use induction on n. You will need the law of exponents proved in 
Example 4.16.] 

9. Prove: Vz € R,Vy € R, Yn E€ Zso, (zy)? = a"y”. 

10. Define ap = 1 and an+1 = 2an — 1 for all n € Z>o. (a) Compute an for 0 <n < 5. 
(b) Based on the data in (a), guess a general formula for a, and prove it by 
induction. 
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11. 


12. 


13. 


14. 
15. 


16. 


17. 


18. 


19; 


20. 


21. 


22. 


Define ao = 4 and a@n41 = 2an + 3- 2” for all n € Z>o. Prove: 
Vn E Zs, an = (3n + 4)2”. 


n 


Let x and y be fixed, unequal real numbers. (a) Simplify ae forl<n<4. 
oe 


(b) For general n € Z51, guess a summation formula that evaluates to 
and prove it by induction on n. 


i : Z 4 (+15 
(a) Prove by induction: for all n € Z>1, Nog < Se 
j=1 
(b) Prove the inequality in (a) using integral calculus. 
Fill in the missing details of the proof in Example 4.20. 


Negative Exponents. The goal of this problem is to prove the laws of exponents 
where the powers involved can be any integers (possibly negative). Given any 
nonzero real number z, the axioms for R tell us that there exists a unique real 
number 2x’ (the multiplicative inverse of x) such that x- a’ = z' -x = 1. We 
have already defined x” when n is a nonnegative integer. For a positive integer 
n, we now define the negative power z~” to be (x')”. In particular, notice that 
x! = (x)! = 2’ is the multiplicative inverse of x. 

(a) Prove: Vz € Ryo, Vn € Zso, (x’)” = (2”)’. [So (a71)" = (a")“1] 

(b) Prove: Vz € Ryo, Vm € Z, Yn € Z,x™t" = g™q”. 

(c) Prove: Vz € Ryo, Vm E Z, Yn € Z, (x™)” = a™. 

(d) Prove: Vz € Ryo, Vy € Ryo, Vn E Z, (xy)” = x”y”. 


Proof Template 4.9 gives a method to prove statements of the form 

Vn € Z>», P(n) by induction. Use the Induction Axiom 4.5 to justify this proof 
template. [Hint: Given P(n) and b, let Q(n) be the statement P(b + n — 1) for 
each positive integer n.] 


Give an intuitive justification for the Backwards Induction Proof Template 4.17 
by imitating our informal explanation of the Induction Axiom, based on repeated 
use of the Inference Rules for IF and ALL. Illustrate your answer by drawing a 
row of dominos. 


Use induction starting anywhere (see Proof Template 4.9) to give a formal justi- 
fication of backwards induction (see Proof Template 4.17). 


It is known that for all m € Z, either m > 0 or —m > 0. Use this theorem to give 
a formal justification of Proof Template 4.19 for proving statements of the form 
Yn € Z, P(n). 

Let P(n) be a fixed open sentence. Suppose we have proved 

‘Yn € Z, P(n) > P(n+1)” and “Vn € Z, P(n) > P(n — 1)” 

(a) Give a specific example to show that Yn € Z, P(n) can be false. 

(b) Prove, however, that “Yn € Z, P(n) or Ym € Z, ~ P(m)” must be true. 


Induction Stepping by 2. Here is a new template for proving Yn € Zso, P(n): 
Step 1. Prove P(0). Prove P(1). 

Step 2. Fix no € Zso. Assume P(no) is true. Prove P(no + 2) is true. 

Show that this template is a correct method for proving the given statement. 
[Hint: Let Q(n) be the statement “P(n) and P(n + 1).” Prove Vn € Z>0,Q(n) 
by ordinary induction.] 


Define ao = 1, a, = 4, and an = 9an—2 — 8 for all n € Z>2. Guess a non-recursive 
formula for an, and then prove it using the template in Exercise 21. 
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23. Prove: for every odd integer n, 8 divides n? — 1. [Suggestion: Write n = 2k +1 
and use induction on k.| 


24. Let s > 1 be a fixed integer (the step size). State and prove a generalization of 
Exercise 21 where part 2 of the template fixes no € Z>0, assumes P(no) is true, 
and proves P(ng + s) is true. 
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4.3 Strong Induction 


In an ordinary induction proof of Yn € Z>1, P(n), we are allowed to assume P(n) in the 
induction step (where n is fixed) to help us prove P(n + 1). This method works well in 
many situations, but sometimes the assumption is not powerful enough to enable us to 
finish proving P(n + 1). Strong induction lets us assume even more information as part 
of the induction hypothesis. Instead of assuming only P(n) when proving P(n + 1), we 
are now allowed to assume that all of the preceding statements P(1), P(2), ..., P(n) are 
true. This additional information gives us more to work with as we try to prove P(n + 1). 
Strong induction is often needed to handle recursive definitions where the recursively defined 
quantity f(n + 1) depends on several of the preceding objects f(1), f(2),..., f(n). 


The Strong Induction Proof Template 


We begin with an example of an attempted proof by ordinary induction where the induction 
hypothesis does not give us enough information to finish the proof. 


4.21. Example. Define a sequence recursively by setting Fo = 0, Fı = 1, and Fn = 
Fy-1 + Fn-2 for all integers n > 2. (This is the famous Fibonacci sequence.) 

Prove: Vn E€ Z>0, Fn < 2”. 

Proof. We try a proof by induction on n. For the base case, we must prove Fy < 2°. Since 
Fo = 0 and 2° = 1 and 0 < 1, the base case is proved. Now fix an arbitrary integer n > 0, 
assume F, < 2”, and try to prove F,41 < 2”+!. On one hand, if n = 0, we are proving 
F, < 2', which holds since F} = 1 and 2' = 2. On the other hand, if n > 0, then n+1 > 2, so 
the recursive definition of F,,41 tells us that F,,41 = Fn + F,_1. Here is where the ordinary 
induction proof gets stuck. We can use the induction hypothesis to bound the first term on 
the right side by 2”. Similarly, we would like to bound the second term on the right side by 
writing Fa—ı < 2-1. Unfortunately, our induction proof (as we have currently set it up) 
has not provided us with any assumptions about F,-1. So the proof by induction cannot 
be completed. 


In the previous example, we could have finished the proof if we had been allowed to 
assume that P(n — 1) and P(n) were both true, before trying to prove P(n + 1). We 
now present a variation of induction, called strong induction or complete induction, that 
allows us to assume even more information as part of the induction hypothesis. These extra 
assumptions can make it easier to complete the induction step. 


4.22. Strong Induction Theorem. Let P(n) be any open sentence, and let b be a fixed 
integer. To prove Vn € Z>», P(n), we may instead prove this statement: 


For all integers n > b, if (for all integers m such that b < m < n, P(m) is true), 
then P(n) is true. 


We can recast this theorem as the following proof template for a strong induction proof. 


4.23. Strong Induction Proof Template. To prove Yn € Z>», P(n) by strong induction: 
Fix an arbitrary integer no > b. 

Assume: for all integers m in the range b < m < no, P(m) is true. 

Prove P(ng) is true. 
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Five Remarks on Strong Induction 


(1) The Strong Induction Theorem can be proved from the (ordinary) Induction Axiom; we 
give a proof in an optional section later. 
(2) Here is intuition for why strong induction works. Intuitively, any induction proof 


starting at b proves the statements P(b), P(b+ 1), P(b+ 2), ..., in this order. Let no > b 
be a fixed integer. When the proof reaches P(ng) and tries to prove it, we will have already 
successfully proved all preceding statements P(b), P(b+1),..., P(no — 1). So we might as 


well assume all those statements are true when attempting to prove P(no). This is exactly 
the assumption in the proof template above; we call this assumption the (strong) induction 
hypothesis. The word “strong” (or “complete”) refers to the fact that we are allowed to 
assume a stronger statement in this proof template compared to the ordinary induction 
proof template. 

This remark can be informally visualized using the domino analogy. 


Here P(n) is the statement “domino n falls down.” Suppose we are proving that all dominos 
fall down by induction. By the time we reach domino 6, we already know that the first 5 
dominos have fallen down. So it is allowable to assume P(1) A P(2) A P(3) A P(4) A P(5), 
not just P(5), to help us prove P(6). 

(3) Now that strong induction is available, we could always use this proof technique 
instead of ordinary induction. However, ordinary induction proofs are conceptually simpler 
than strong induction proofs, so many people prefer ordinary induction for problems where 
this method succeeds. As a general rule, ordinary induction is applicable to recursive defi- 
nitions where f(n + 1) depends only on f(n); whereas strong induction is needed for more 
complicated recursive definitions. 

(4) The strong induction proof template does not specifically mention a base case, unlike 
the ordinary induction proof template. However, there is a hidden base case in strong 
induction. Consider the situation where the fixed integer no happens to equal b. Then there 
are no integers m satisfying b < m < no, and so our strong induction hypothesis is not 
actually assuming anything for this value of no. It follows that we need to prove P(b) 
without any prior knowledge, which is exactly what we must do in the base case of an 
ordinary induction proof. 

(5) In strong induction proofs, it often happens that several small values of no (not just 
no = b) need to be treated in separate cases, which could be called the “base cases” of the 
strong induction proof. These special cases often correspond to the initial conditions in a 
recursive definition, as illustrated in the following examples. 


Examples of Strong Induction Proofs 
We begin by revisiting the failed proof above. 


4.24. Example. Define Fo = 0, Fı = 1, and Fn = Fn-1 + Fn-2 for all integers n > 2. 
Prove: for all integers n > 0, Fn < 2”. 
Proof. We use strong induction on n. Fix an integer n > 0. Assume that for all integers m 
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in the range 0 < m < n, Fm < 2. Prove Fn < 2”. We know n = 0 or n = 1 or n > 2, so 
use cases. [The previous statement was motivated by the recursive definition of Fp, which 
has different cases for n = 0, n = 1, and n > 2.] 

Case 0. Assume n = 0; prove Fy < 2°. We know Fp = 0 and 2° = 1 and 0 < 1, so Fp < 2°. 
Case 1. Assume n = 1; prove F, < 21. We know F; = 1 and 2t = 2 and 1 < 2, so F} < 2!. 
Case 2. Assume n > 2; prove Fa < 2”. To start, we use the recursive definition to write 
Fy = Fn-1 + Fn-2. Now, 0 < n— 1 < n, so the strong induction hypothesis (taking 
m =n-—1) tells us Fp—ı < 2”~!. Furthermore 0 < n— 2 < n, so we also know F,_2 < 2"~? 
by taking m = n — 2 in the induction hypothesis. Hence, 


Fy = Fy + Pye < 271 4+ 2 * = 2" (2-1 + 2-4) = 27 (3/4) < 2”, 
completing the proof of this case. The induction proof is now finished. 


4.25. Example. Define ap = 3, a, = 8, and an = 4(an_1 — an-—2) for all integers n > 2. 
Prove: for all integers n > 0, an = (n+ 3)2”. 

Proof. We use strong induction on n. Fix an arbitrary integer n > 0. Assume that for all m 
in the range 0 < M < Nn, Gm = (m + 3)2™. Prove an = (n + 3)2”. We know n =0orn=1 
or n > 2, so use cases. [This step was suggested by the cases in the recursive definition of 
an.] 

Case 0. Assume n = 0; prove ao = (0 + 3)2°. We know ap = 3 = (0 + 3)2°, so this case is 
proved. 
Case 1. Assume n = 1; prove a; = (1 +3)21. We know a; = 8 = 4-2 = (1 +3)2!, so this 
case is proved. 

Case 2. Assume n > 2; prove ad, = (n + 3)2”. In this case, we have 0 < n— 1 < n and 
0 < n—2 < n, so the induction hypothesis tells us that an-ı = (n — 1 + 3)2”71 and 
an—2 = (n — 2 + 3)2”-?. Using the recursive definition and this information, we deduce 


Qn = 4(an—1 — Gn—2) = 4((n + 2)2"~-* — (n+1)2”-?) = 2"(2(n + 2) — (n +1)) = 2” (n +3). 
So the result holds in all cases, completing the induction. 


4.26. Example. Define ap = 1 and a, = ( = ak) + 1 for all n > 1. Guess an explicit 
(non-recursive) formula for an and then prove it. 

Solution. We use the recursive definition to compute ag = 1, a1 = ao +1 = 1+1 = 2, 
a2 =a ta,¢1=—=14+24+1= 4, a3 = ao + @ı +a2 +1 =1+4+2+4+1 = 8, a4, = 
ao + aı +a2 +a3 +1 =1+2+4+8+1 = 16, and so on. It looks like a, = 2” for all 
integers n > 0. Since the recursive formula for an involves all preceding values az, we prove 
our guess by strong induction on n. Fix an arbitrary integer n > 0. Assume: for all integers 
k in the range 0 < k < n, ak = 2%. Prove an = 2”. [The definition of an breaks into two 
parts, suggesting the following two cases.] We know n = 0 or n > 0, so use cases. 

Case 1. Assume n = 0; prove ag = 2°. We know ag = 1 = 2°. 

Case 2. Assume n > 0; prove an = 2”. In this case, we know an = ( pee ay) +1. Each term 
ak appearing in the sum has an index k in the range 0 < k < n. So the strong induction 
hypothesis allows us to replace every term a; by 2” in the sum defining an. We now know 
An = ( 2, 2")+41. By a previous theorem (proved in Example 4.10 by ordinary induction), 
the sum appearing here evaluates to 2”71+1 — 1 = 2” — 1. So ap = (2? — 1) +1 = 2”, 
completing the induction proof. 


4.27. Example. Prove: Vn € Z>18, Ja E€ Z>0,3b € Zs0,n = Ta + 4b. [This statement can 
be concretely interpreted as follows: for all n > 18, we can pay n cents postage using a 
combination of 7-cent stamps and 4-cent stamps.] 

Proof. We use strong induction on n. Fix an integer n > 18. Assume: for all integers m in 
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the range 18 < m < n, de € Z>0, dd E€ Z50,m = 7c + 4d. Prove: Ja € Z>0,3b E€ Zso,n = 
7a + 4b. [Note the use of non-conflicting letters for the existentially quantified variables in 
the induction hypothesis and the goal.] We know n = 18 or n = 19 or n = 20 or n = 21 
or n > 22, so use cases. [This is the tricky part — how did we know to introduce these 
particular cases? The reason will retroactively emerge at the end of the proof] 


Case 1. Assume n = 18. Choose a = 2 and b = 1, which are in Zyo. By arithmetic, 
Ta + 4b = 7(2) + 4(1) = 144+4=18=n. 
Case 2. Assume n = 19. Choose a = 1 and b = 3, which are in Z>ọo. By arithmetic, 
Ta + 4b = 7(1) + 4(3) =74+12=19 =n. 
Case 3. Assume n = 20. Choose a = 0 and b = 5, which are in Z>ọ. By arithmetic, 
Ta + 4b = 7(0) + 4(5) = 0 + 20 = 20 =n. 
Case 4. Assume n = 21. Choose a = 3 and b = 0, which are in Z>ọ. By arithmetic, 
Ta + 4b = 7(3) + 4(0) = 21 + 0 = 21 = n. [Note that the induction hypothesis was not 
needed to complete the first four cases.] 


Case 5. Assume n > 22. The key observation is that for such an n, the integer m = n — 4 
satisfies 18 < m < n. So we can take m in the induction hypothesis to be n— 4. By definition 
of ALL, we know there exist co € Z>o and do € Z>o with n — 4 = Tco + 4do. Adding 4 to 
both sides, we get n = 7co + 4do + 4 = 7co + 4(do + 1). Choose a = co and b = do + 1, which 
are both nonnegative integers by closure (since co and dp are in Zo). Then n = Ta + 4b 
holds, completing the proof of Case 5. [Now, at the end of the proof, we understand why 
the given cases were used. The point is that we can go from a solution for n— 4 to a solution 
for n by adding one more 4-cent stamp. But n — 4 falls outside the allowable range of m’s 
when n is 18, 19, 20, or 21. So these values of n must be treated in separate base cases.] 


Proof of the Strong Induction Theorem (Optional) 


We now prove the Strong Induction Theorem. Fix an integer b and an arbitrary open 
sentence P(n). Assume that we have proved the following statement: 


For all integers n > b, if (for all integers m such that b < m < n, P(m) is true), 
then P(n) is true. 


We must prove the statement Yn € Z>», P(n). The idea is to begin by proving a related 
statement by ordinary induction starting at b. Define a new open sentence Q(n), which is 
the statement “for all integers m in the range b < m < n, P(m) is true.” We will prove 
Yn € Z>., Q(n) by induction on n. 

For the base case, we must prove the statement Q(b). This statement says “for all integers 
m in the range b < m < b, P(m) is true.” There are no integers m satisfying b < m < b, so 
this statement is vacuously true. 

For the induction step, fix an integer n > b, assume Q(n) is true, and prove Q(n + 1) is 
true. We have assumed the following induction hypothesis: 


for all integers m in the range b < m < n, P(m) is true. 
We must prove: 
for all integers m in the range b < m < n +1, P(m) is true. 


Fix an integer mp, and assume b < mp < n + 1. Since mo is an integer, we must have 
b < mo < n or mo = n. In the case where b < mo < n, the induction hypothesis and the 
Inference Rule for ALL tell us that P(mo) is true. In the case where mo = n, the induction 
hypothesis coincides with the hypothesis of the IF-statement that we assumed at the start 
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of the proof. By the Inference Rule for IF, we see that P(n) is true. Because mo = n, P(mo) 


is true in this case as well. 


We have completed the induction proof of the statement Vn € Z>», Q(n). We use this 
to prove Vn € Z>», P(n). Fix an arbitrary integer n > b. Then n + 1 is also an integer in 


Z>», so Q(n + 1) is true. So for all integers m in the range b < m < n +1, P(m) is true. 


Since n is an integer satisfying b < n < n + 1, we can take m = n to conclude that P(n) is 


true, as needed. 


Section Summary 


T; 


Strong Induction. To prove Vn € Z>», P(n) by strong induction: 

Step 1. Fix an arbitrary integer n > b. 

Step 2a. Assume that for all integers m in the range b < m < n, P(m) is true. 
Step 2b. Aided by this assumption, prove P(n) is true. 


Remarks on Induction Proofs. Ordinary induction can be used when the truth 
of P(n +1) can be deduced assuming only that the previous statement P(n) is 
true. Strong induction is needed when the truth of P(n+1) depends on several of 
the preceding statements P(n), P(n—1), .... Both types of induction proofs can 
be justified using the ordinary Induction Axiom. Although the strong induction 
proof template does not specifically mention a base case, we often need to treat 
small values of n as separate cases. The required cases in the proof are often based 
on corresponding cases in the recursive definitions of concepts appearing in the 
statement P(n). 


Exercises 


ils 


Recursively define ag = —1, a; = 1, and an = 8an_1 — 15an_2 for all n € Z>2. 
Prove: for all n € Z>0, an = 2-5" — 3"*1. 

Define a sequence recursively by setting a, = 2, a2 = 1, and an = 2an_1 — Gn_2 
for all integers n > 3. (a) Compute an for 1 < an < 6, and then guess an explicit 
(non-recursive) formula for an. (b) Use strong induction to prove your formula 
for an holds for all integers n > 1. 

Define ap = 4, a, = 32, and a, = 12a,_1 — 20a,_2 for all integers n > 2. Guess a 
non-recursive formula for a, (valid for all n > 0), and prove your guess by strong 
induction. 


Use ordinary induction or strong induction to prove the following summation 
formulas for Fibonacci numbers (see Example 4.21), valid for all integers n > 0. 


(a) Dko Fe = Fato—1. (b) ERIS Forti = Fon. (c) Eko For = Fanti — 1. 
Prove by strong induction: Vn € Z>11, Ja € Zyo, db E€ Zyo0,n = 2a + 5b. 


Consider this statement: Yn € Z>no, Ja € Z>0, 4b € Z>0,n = 12a + 5b. Find the 
smallest value of no for which the statement is true, and then prove it. 


Define ao = 1, a1 = 3, ag = 33, and an = Tan_1 — 8an—2 — l6an-3 for all n € Z>3. 
Prove: for all n € Zso, an = (—1)” + n4”. 


Define ao = 1, a1 = 2, ag = 5, and an = 3an_1 — 8an—2 + Gn—3 for all n € Z>3. 
Guess an explicit formula for an and prove your formula holds for all n € Zso. 
Define bı = 1, bg = 2, b3 = 3, and bp = bn—1 + bn_2 + bn-3 for all n > 4. Prove 
by strong induction: for all n E€ Z>1, bn < 2”. 
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10. Let F„ be the Fibonacci sequence defined in Example 4.21. Find and prove a 
formula for F? — F,,41F,~1, valid for all n € Zs}. 


11. Prove the following exact formula for the Fibonacci numbers Fp, (see Exam- 
ple 4.21): for all n € Zo, Fn = [((1 + V5)” — (1 — V5)"]/(2"V'5). 

12. In Example 4.26, prove an = 2” for all integers n > 0 by ordinary induction (not 
strong induction). 


13. Define co = 1 and cy, = cag ieee Ck for all integers n > 0. Guess an explicit 
formula for cn and then prove it. 


14. Define dọ = 1 and dy41 = pee 2d, for all n € Z>o. Prove: for all integers 
n > 2, dn = [hs OP +1). 


15. Give just the structural outline of a proof by strong induction of this statement: 


Vn € Z>1, Ja € Z>0, 3b € Z>0, 3c € Z>0, Jd € Zon, n= 0 +b +P +R. 


16. (a) Prove that for every integer n > 0, there exists k € Z such that n = 4k or 
n = 4k +1 or n = 4k +2 or n = 4k + 3. (b) Prove that the result in (a) also holds 
for all negative integers n. 


17. Using the theorem that every integer is even or odd, prove by strong induction: 
for all n € Z>1, there exist k € Z>ı and distinct exponents e1, €2,..., €k E Z>0 
such that n = 2° + 2° +... + 2°, 


18. (a) Prove: for all n € Zs3, Fn41 < 2Fn, where Fn is a Fibonacci number (Ex- 
ample 4.21). (b) Use strong induction to prove that every positive integer n can 
be expressed as a sum of one or more distinct Fibonacci numbers. Suggestion: 
Consider the largest Fibonacci number not exceeding n. 


19. Consider the following template for proving Vn € Zso, P(n): 
Step 1. Prove P(0). Prove P(1). 
Step 2. Fix n € Zso. Assume P(n) and P(n + 1). Prove P(n + 2). 
Prove the correctness of this template using ordinary induction (not strong in- 
duction). 


20. (a) Create a strong induction proof template to prove statements of the form 
Yn € Zee, Q(n). (b) Use the Strong Induction Theorem 4.22 to prove that your 
template in (a) works. 


21. Let P(x) be the open sentence “0 < x < 1,” where x is a real variable. (a) Dis- 
prove: Vz € Ro, P(x). (b) Prove: For all z € Rso, if (for all real numbers y in the 
range 0 < y < x, P(y) is true) then P(x) is true. (c) Conclude that the Strong 
Induction Theorem 4.22 is not valid if we replace integers by real numbers. 


22. Suppose we know: “for all n € Z, if (for all integers m < n, P(m) is true), then 
P(n) is true.” Must it follow that “Vn € Z, P(n)” is true? Explain. 


23. Suppose we know that every nonempty subset of Zs, has a least element (see 
Exercise 18 in §4.1). Use this fact to prove the Strong Induction Theorem 4.22, 
taking b = 1. [Suggestion: Let S be the set of positive integers n such that P(n) 
is false; prove S = Q.| 
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4.4 Prime Numbers and Integer Division 


Now that we have strong induction, we can investigate the divisibility properties of the 
positive integers more closely. We begin by defining prime numbers and showing that every 
integer n > 1 can be written as a product of primes. Then we study integer division, which 
is a process for dividing an integer a by a nonzero integer b to produce a unique quotient q 
and remainder r. 


Primes and Prime Factorizations 


A composite number is a positive integer that can be expressed as the product of two smaller 
positive integers, whereas a prime number cannot be written in this way. Formally, we have 
the following definition. 


4.28. Definition: Prime and Composite Integers. 


(a) For alln € Zs1,)n is composite |iff| da € Z,4b E Z,(1<a<n)A(1<b<n)An=ab|. 


(b) For all n € Z1, |n is prime | iff | n is not composite |. 


Note that we give these definitions only for integers n > 1. By convention, 1 is not prime 
and 1 is not composite. Similarly, 0 is neither prime nor composite. It can be proved from 
the above definition that an integer p > 1 is prime iff the only positive divisors of p are 1 
and p. The first few primes are 


2,3, 5,7, 11, 13, 17, 19, 23, 29, 31,37,.... 
Our next theorem states that every integer in Zs; can be factored into primes. 


4.29. Theorem on Existence of Prime Factorizations. Every integer n > 1 can be 
written as a product of one or more primes. More precisely, for all n € Z>1, there exist 
k € Zs, and primes p1, P2,.-., Pk with n = pip2--- pp. (The primes appearing here need 
not be distinct.) 

Proof. We use strong induction on n. Fix an arbitrary integer n € Zs,. Assume: for all 
integers m in the range 1 < m < n, m can be written as a product of one or more primes. 
Prove n can be written as a product of one or more primes. We know n is prime or n is not 
prime, so use cases. 

Case 1. Assume n is prime. Then n is a product of one prime, namely itself. (In other words, 
we can choose k = 1 and pı = n above, to obtain n = pj.) 

Case 2. Assume n is not prime. Then n is composite, so there exist a,b € Z withl <a<n 
and 1 < b < n and n = ab. Since a and b are both larger than 1 and less than n, we can 
apply the induction hypothesis to both a and b to see that a is a product of one or more 
primes, and b is a product of one or more primes. Say a = p,po-:- pp and b = qiq2°-- Gm 
where every p; and q; is prime. Then n = ab = pip2---peqid2::* Gm, which expresses n as 
a product of k + m primes. 


Later, we strengthen this theorem by proving that the prime factorization of n is unique 
in a certain respect. More specifically, if we have written n = pip2--- pr and n = q1q2 -` Gm 
where all p; and q; are prime, then we must have k = m and q,...,q% is a reordering of 
Pi,---,Pr- For now, we use the existence of prime factorizations to prove a famous theorem 
of Euclid about prime numbers. 
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4.30. Theorem on Infinitude of Primes. There are infinitely many prime integers. More 
precisely, for any k € Z> 1 and any finite list of primes p;,po,..., px, there exists a prime q 
different from every pi. 

Proof. Fix k € Z>, and primes p1, p2,...,pr. Consider the integer n = 1 + M Pr. We 
know n is prime or n is not prime. 

Case 1. Assume n is prime. Since n is larger than every p;, n is a prime number distinct 
from all p;. 

Case 2. Assume n is not prime. Then we can write n as a product of primes, say n = 
mq2°::q~ with all qj prime. We prove qı is a prime different from every p;, using proof 
by contradiction. Assume, to get a contradiction, that there exists i with qı = pi. Then 
pi divides n since n = qiq2°-+ qk = pilqo-+: qe). Also p; divides re pr. It follows that p; 
divides 1 = n — U Pr. This is impossible, since p; > 1. This contradiction completes the 
proof of Case 2. 


Integer Division 


In grade school, we learn to divide one integer by another, producing a quotient and a 
remainder. For example, if we divide 100 by 7, the quotient is 14 and the remainder is 
2. One way to present this information is by writing 100/7 = 14 + (2/7). When studying 
integers, it is convenient to avoid fractions by multiplying this equation by the denominator 
7. We obtain the integer equation 100 = 7- 14+ 2. 

In general, we want to be able to divide any nonnegative integer a by any positive integer 
b to produce a quotient q and a remainder r. In other words, given a > 0 and b > 0, we 
seek integers q > 0 and r > 0 satisfying a/b = q + (r/b), or equivalently, a = bq + r. The 
remainder r should be small in the sense that r/b should be less than 1, which holds iff 
O0<r<b. Our next result shows that this division process is always possible. To motivate 
the proof, consider the following example. 


4.31. Example. Let a = 25 and b = 7; find the quotient and remainder for this a and b. 
Solution. The answer can be found by the long division algorithm learned in school. But 
here is a more basic solution relying on the idea that we can perform a division by repeated 
subtraction. Start with 25 and subtract 7 repeatedly. First, 25—7 = 18. Second, 18—7 = 11. 
Third, 11—7 = 4. The current value 4 is less than 7, so we have found the remainder r = 4. 
We subtracted 7 three times to reach the remainder, so the quotient is q = 3. 

The idea of the induction proof below is to recast this iterative computation in a recursive 
way. Specifically, in a single subtraction, we pass from the initial input 25 to 25 — 7 = 18. 
We could recursively calculate the quotient and remainder when 18 is divided by 7 (namely, 
qı = 2 and rı = 4), and then obtain the quotient and remainder for 25 by adding 1 to the 
quotient (giving q = 3) and keeping the old remainder (giving r = 4). The initial condition 
for the recursive calculation occurs when the initial number a is already less than b. Then 
we can take q = 0 and r = a. For example, 4 divided by 7 has quotient 0 and remainder 4. 


Let us see how the intuition in the preceding example translates into a formal proof. 


4.32. Integer Division Theorem (Preliminary Version). For all integers a > 0 and 
b > 0, there exist integers q,r with a= bq +r and0 <r <b. 

Proof. To better understand how the induction proof handles quantifiers, we write the 
statement to be proved in formal symbols as follows: 


Vb € Zo, Va € Z>0, 3q E€ Zar E€ Z,a = bq +r ^A (0 <r <b). 
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Fix b € Z>o. We prove the inner statement by strong induction on a. Fix an arbitrary 
integer a € Z>ọo. Assume: for all integers c in the range 0 < c < a, 


dq € Z, Jrı € Z,c= bq, + rı and 0 < rı < b. 


We must prove: 


dq € Z, Jr € Z,a = bq +r and0 <r <b. 


We know a < b or a > b, so use cases. 

Case 1. Assume a < b. Choose q = 0 and r = a. We know q,r € Zso, a = bq +r (since 
a=b-0+a) and 0 < a < b, so our choice proves the existence statement. 

Case 2. Assume a > b. In the induction hypothesis, let us take c = a — b. This is a legitimate 
value of c, since c = a — b > 0 and c = a — b < a (because b > 0). Now, the induction 
hypothesis gives us integers q1, rı > 0 with c = bq, + rı and 0 < rı < b. Adding b to both 
sides, a = c + b = bq + rı +b = b(qı + 1) +11. So, choosing q = qı + 1 and r = rı (which 
are in Z>o), we have a = bq +r and 0< r <b. 


Integer Division for Negative Integers (Optional) 


We now improve the Integer Division Theorem in two ways. First, we extend the result to 
the case where a and b might be negative integers (although b can never be zero). Second, 
we prove that the quotient and remainder are uniquely determined by a and b. The next 
several lemmas deal with these issues. We begin by allowing the dividend a to be negative; 
in this case, the quotient q might be negative. 


4.33. Lemma: Extension of Integer Division to Negative a. 


Vb € Zyo,| Va € Z| dq € Z, Jr € Z,a = bq +r and0 <r <b. 


Proof. Fix b € Zyo. Fix a € Z. We know a > 0 or a < 0, so use cases. 

Case 1. Assume a > 0. Then the needed result follows from the previous theorem. 

Case 2. Assume a < 0. Then —a > 0, so the previous theorem says there exist q2,r2 € Z 
with —a = bq2 +r and 0 < rp < b. Now, a = —bq2 — r2. [We are tempted to choose q = —q2 
and r = —r2, but this produces a negative remainder most of the time. This suggests the 
following subcases.] We know r2 = 0 or rg > 0, so use cases. 

Case 2a. Assume rg = 0. Then a = b(—q2) + 0, so we can choose q = —q2 and r = 0. 

Case 2b. Assume r2 > 0. Then —b < —rg < 0, so (adding b) 0 < b — r2 < b. So we can 
manipulate a = —bq2 — r2 as follows: we know 


a = —bq2 — r2 = —bq2 — b + b — r2 = b(—q2 — 1) + (b — r2). 

Now choose q = —q2 — 1 € Z and r = b — r2 € Z; observe that 0 < r < b for this choice of r. 
4.34. Example. To illustrate the above proof, first suppose a = —21 and b = 7. Then 
—a = 21 = 7 -3 +0 (so q2 = 3, r2 = 0), and we see that a = —21 = 7- (—3) +0 (so q = —3, 
r = 0). On the other hand, suppose a = —26 and b = 7. Then —a = 26 = 7-3+5 (so q2 = 3, 
ra = 5). Negating gives a = —26 = 7- (—3) — 5, which has a forbidden negative remainder. 
We fix this by writing a = —26 = 7 - (—3)— 7 +7 -—5= 7. (—4) +2, so q = —4 = -qQ — 1 
and r = 2 = b — r2, in accordance with the proof above. 

For the next extension, we allow the divisor b to be negative. We still want nonnegative 
remainders, so we now require r to satisfy 0 < r < |b]. 


4.35. Lemma: Extension of Integer Division to Negative b. 


Vb € Zyo |, Va € Z, 3q € Z, dr € Z,a = bq +r and 0 <r < |b]. 
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Proof. Fix b € Zz9. We know b > 0 or b < 0, so use cases. 

Case 1. Assume b > 0. In this case, we already proved the needed result in the previous 
lemma. 

Case 2. Assume b < 0. Then b = —|b| where |b| € Z>o. Applying the previous results to 
divide a by |b|, we see there exist integers q3,r3 E€ Z>o with 


a = |b|q3 + r3 and 0 < r3 < |b]. 


Now, |b| = —b, so a = (—b)q3 + r3 = b(—q3) + r3. Choose q = —q3 and r3 = r to prove the 
existence assertion in the lemma. 


For example, dividing a = —26 by b = —7 gives quotient q = 4 and remainder r = 2. 


Uniqueness of the Quotient and Remainder 


The final, crucial improvement to our results so far is to prove that the quotient and 
remainder are uniquely determined by a and b. Here is the full-fledged statement of the 
Integer Division Theorem. The proof assumes some basic facts about absolute value are 
known; these facts are discussed in more detail in §8.4. 


4.36. Integer Division Theorem (Final Version). For all integers a and all nonzero 
integers b, there exist unique integers q,r with a = bq +r and 0 < r < Jbl. 

Proof. First we restate the theorem in formal symbols to better understand the uniqueness 
assertion. We must prove: 


Vb € Z420, Va € Z,3! (q,r) EZ x Z,a =bq+r^(0<rx< jbj). 


Fix arbitrary b € Zyzo and a € Z. We have already proved the existence of (q,r) above. 
To prove uniqueness, we use the uniqueness proof template (recall this arises from the 
rule for eliminating the uniqueness symbol). Fix arbitrary (q1,r1) € Z x Z and arbitrary 
(q2,r2) E Z X Z. Assume a = bq, + rı and 0 < rı < |b| and a = bq2 + r2 and 0 < r2 < |b]. 
We must prove (q1,71) = (q2, r2), which means qı = q2 and rı = rz (by the Ordered Pair 
Axiom). We first prove q = q2 by contradiction. Assume q1 Æ q2, and derive a contradiction. 
Combining the two equations for a assumed earlier, we get bqı +r, = bq2 +r2. Some algebra 
transforms this to b(qı — q2) = r2 — rı. Since qı and q2 are unequal integers, |qi — g2| > 1, 
and hence |r2 — rı| = |6||¢1 — q2| > |b| - 1 = |b|. So |r2 — rı| > |b|. On the other hand, the 
assumed inequalities for rı and ro combine to show that rı — rə < |b| and rə — rı < |]. 
Since |r2 — rı| equals rg — rı or rı — r2, we therefore have |r2 — rı| < |b]. We have reached 
the contradiction “|r2 — rı| > |b| and |r2 — rı| < |b|.” Therefore, q1 = q2. Now, to prove 
Tı = r2, rewrite the assumed equations for a to get rı = a — bq, = a — bqg = 12. 


4.37. Remark. Consider the special case of the Division Theorem where b = 2. On one 
hand, the existence part of the theorem says that any a € Z can be written in the form 
a = 2q+r where 0 < r < 2. In other words, for every a € Z, either there exists q € Z with a = 
2q+0, or there exists q € Z with a = 2q+ 1. This says that every integer a is even or odd. On 
the other hand, one consequence of the uniqueness of (q,r) for a is that the remainders 0 and 
1 cannot both work for the same fixed integer a. So, no integer a is both even and odd. This 


gives another proof of the theorem that | every integer is either even or odd, but not both |, 


which we used frequently in earlier sections. 

We can obtain similar results for other specific choices of b. For example, letting b = 3, 
we see that every integer a can be written in exactly one of the forms a = 3q, a = 3q+1, or 
a = 3q + 2 for some integer q. Similarly, taking b = 10, we see that every integer a has the 
form a = 10q +r for unique q,r € Z with r € {0,1,2,3,4,5,6, 7,8, 9}. This observation is 
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the first step in rigorously proving the existence and uniqueness of the decimal expansions 
(or more generally, the base-b expansions) of integers. More details on this process appear 
in some of the exercises below. 


Section Summary 


1. Prime and Composite Numbers. An integer n > 1 is composite iff n = ab for some 
integers a,b with 1<a<nand1<b<n. An integer p > 1 is prime iff p is not 
composite iff the only positive divisors of p are 1 and p. 


2. Facts about Primes. Every integer n > 1 can be written as a product of one or 
more primes. There are infinitely many primes. 


3. Integer Division Theorem. For all integers a and b with b Æ 0, there exist unique 
integers q and r such that a = bq +r and 0 < r < |b|. We call q the quotient and 
r the remainder when a is divided by b. Taking b = 2, we see that every integer 
is either even or odd, but not both. 


Exercises 


1. (a) List all primes less than 100. (b) Write each integer as a product of primes: 
91, 96, 111, 3003, and 8000000. 


2. For each a and b below, find the quotient and remainder when a is divided by b. 
(a) a = 58, b= 11 (b) a = 58, b = —11 (c) a= —58, b = 11 (d) a = —58, b = —11 
(e) a = 0, b = 11 (f) a = 6, b = 11 (g) a = —6, b = 11 (h) a = 6, b = —11. 

3. For each a and b below, find the quotient and remainder when a is divided by b. 
(a) a = 300, b = 27 (b) a = 9103, b = 11 (c) a = 4574, b = 19 (d) a = 100000, 
b = 123. 


4. Use Definition 4.28 to prove that an integer p > 1 is prime iff the only positive 
divisors of p are 1 and p. 


5. Prove: Vn € Zx0,Vm € Zo, if m is composite, then nm is composite. 


6. True or false? Prove your answers. 

a) The product of any two positive integers is always composite. 

b) The sum of two prime integers is never prime. 

c) The sum of any two composite integers is composite. 

d) The product of any two composite integers is composite. 

(e) Every composite integer has an even number of positive divisors. 


( 
( 
( 
( 


7. Suppose k € Zyo and p1,...,pp are integers. Give a careful proof that for all i 
between 1 and k, p; divides I Pr. (This fact was used in the proof of Theo- 
rem 4.30.) Suggestion: Prove p; divides [[}—; pr for all s in the range i < s < k 
by induction. 


8. (a) Prove that every prime integer other than 2 is odd. (b) Prove that every 
positive integer is either a power of 2 or is divisible by an odd prime. 


9. Use the uniqueness assertion in the Integer Division Theorem to prove carefully 
that 5 does not divide 22, without assuming in advance that 22/5 is not an integer. 


10. Use the technique in the previous exercise to give a fully rigorous proof that 7 is 
prime. (Use proof by exhaustion to show that for all b € {2, 3,4,5,6}, b does not 
divide 7.) 
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11. 


12. 


13. 


14. 


15. 


16. 


17. 


18. 


19. 


20. 


21. 
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The following statements resemble the Integer Division Theorem, but they are 
all false. Negate and disprove each statement. 

(a) Va € Z, Vb € ZA! (q,r) E Z x Z, (a =bq+r^0< r< dj). 

(b) Va € Z, Vb € Z40,3! (q,r) EZ x Z, (a =bq+r^A0<r<b). 

(c) Va € Z, Yb € Zuo, AN (q,r) E€ Z x Z, (a =bq+r^0<r< dl). 

(d) Va € Z, Vb € Zz, 4! (q,r) € Z>0 x Z>0, (a = bq +r ^0 <r < dl). 


What are the possible remainders when a prime integer p > 10 is divided by 10? 
Prove your answer. 


What are the possible remainders when a square integer n? is divided by 8? Prove 
your answer. 


(a) Prove or disprove: for all open sentences P(q,r), 


B! r)EZxZ, Plg, r] = Blez, 3!r €Z, P(q,r)]. 


(b) Prove or disprove: for all open sentences P(q, r), 


4!qe€Z,A!reZ,P(q¢r)) = B!(q,r) €Z xZ, P(q,r)]. 


(a) For a,b € Zo, how are the quotient and remainder when a is divided by b 
related to the quotient and remainder when a + 1 is divided by b? (b) Prove the 
preliminary version of the Integer Division Theorem by fixing b > 0 and using 
ordinary induction (not strong induction) on a. 


Prove this variation of the Integer Division Theorem: for all a,b € Z with b Æ 0, 
there exist unique integers q,r with a = bq +r and —|b|/2 < r < |b|/2. 


Fix ro € Z. State and prove a version of the Integer Division Theorem where the 
remainder r is required to satisfy ro < r < ro + |b]. 


For a € Z, let a mod 4 € {0,1,2,3} be the remainder when a is divided by 4. 
(a) Prove by induction: for all n,a1,a2,...,an € Zso, if a mod 4 = 1 for all i, 
then ([]j_, a;) mod 4 = 1. (b) Prove there exist infinitely many primes p such 
that p mod 4 = 3. [Adapt the proof of Theorem 4.30.] 


Decimal Expansions of Integers. (a) Prove by induction that every positive in- 
teger n can be written in the form n = Soy 9 dx 10* where m > 0, each 
dy € {0,1,2,...,9}, and dm 40. (b) Prove that m and do, d,,...,dm in part (a) 
are uniquely determined by n. 

State and prove a generalization of the previous exercise in which 10 is replaced 
by an arbitrary base b € Z1. 

(a) Prove: for all z,c, d € Zso, 2°4 — 1 = (x° — 1) a ger, 

(b) Suppose a > 0, b > 0, and dividing a by b yields quotient q and remainder r. 
For fixed x € Zs, find (with proof) the quotient and remainder when x° — 1 is 
divided by x? — 1. 
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4.5 Greatest Common Divisors 


In this section, we study the greatest common divisor (gcd) of two integers a and b, denoted 
gcd(a, b). We present Euclid’s algorithm for computing gcd(a, b) based on repeated division. 
One consequence of this algorithm is the fact that gcd(a,b) can always be written in the 
form ax + by for some integers x and y. This fact has important applications in modern 


cryptography. 


Greatest Common Divisors 


Before discussing greatest common divisors, we recall the definition of divisibility. 


4.38. Definition: Divisors. For all integers a and c,|c is a divisor of a iff} du € Z,a = cu |. 


When this condition holds, we also say that | c divides a | and | a is a multiple of c |. 


The notation | cla] means that c divides a. 


Note that cla stands for the statement “c divides a,” whereas c/a denotes the rational 
number obtained when c is divided by a. Take care not to confuse these two symbols! 


4.39. Definition: Greatest Common Divisors. Given integers a,b that are not both 
zero, the greatest common divisor gcd(a, b) is the largest integer d such that d divides a and 
d divides b. We also define gcd(0, 0) = 0. 


As a technical aside, it can be shown that gcd(a, b) exists and is unique, using the fact 
that all divisors d of a nonzero integer a satisfy d < |a|; see Theorem 8.58(b). 


4.40. Example. Let a = 88 and b = 60. The positive divisors of a are 1, 2, 4, 8, 11, 22, 44, 88. 
The positive divisors of b are 1,2,3,4, 5,6, 10, 12,15, 20, 30,60. Comparing these lists, we 
see that the positive integers d dividing both a and b are 1, 2, and 4. So gcd(88, 60) = 4. 
Another way to compute greatest common divisors is to compare prime factorizations. Here, 
88 = 23-11 and 60 = 2?- 3-5. Both 88 and 60 are divisible by two copies of the prime 2 
but have no other common prime factors, so we conclude that gcd(88, 60) = 2? = 4. (See 
Theorem 4.60 for a justification of this technique.) 


It turns out that the two methods of finding greatest common divisors just discussed 
— listing all the divisors, or comparing prime factorizations — are horribly inefficient for 
large integers a and b. A much faster algorithm was discovered thousands of years ago by 
Euclid, the author of the famous geometry textbook Elements (which is also a text on 
number theory). The basis for this algorithm is the following result relating integer division 
to greatest common divisors. 


4.41. Theorem on Greatest Common Divisors. (a) For all a € Z, gcd(a,0) = Jal. 
(b) For all a,b,q,r € Z, if b #0 and a = bq + r, then ged(a, b) = gcd(b, r). 
Proof. (a) Fix a € Z. We know a = 0 or a £ 0, so consider cases. 
Case 1. Assume a = 0. Here, gcd(a,0) = 0 = |a| by definition. 
Case 2. Assume a Æ 0. We know d|0 for all integers d, since 0 = d-0. Therefore, the common 
divisors of a and 0 are exactly the same integers as the divisors of a itself. We know the 
largest divisor of a is |a|, so gcd(a, 0) = |a| in this case. 

(b) Fix a,b,q,r € Z. Assume b Æ 0 and a = bq + r. Prove gcd(a,b) = ged(b,r). Our 
strategy is to prove that the set of common divisors of a and 6 coincides with the set of 
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common divisors of b and r; since it then follows that the largest element of the first set 
(namely gcd(a, b)) must equal the largest element of the second set (namely gcd(b,1r)). 

To show these two sets are equal, fix an integer d; we prove that d|a and d|b iff d|b and 
d|r. For one direction, assume d|a and d|b, so a = ds and b = dt for some s,t € Z. We 
must prove d|b and d|r. We already assumed d|b. Also, r = a — bq = ds — dtq = d(s — tq) 
where s — tq is an integer, so dir. For the other direction, assume d|b and dr, so b = dt 
and r = du for some t,u € Z. We must prove d|a and d|b. We already assumed d|b. Also, 
a = bq +r = dtq + du = d(tq + u) where tq + u is an integer, so dla. 


Euclid’s GCD Algorithm 


The previous theorem justifies the following recursive algorithm for computing gcd(a, b), 
called Euclid’s Algorithm. 


4.42. Euclid’s Recursive GCD Algorithm. 

Input: a,b € Zso. 

Output: gcd(a, b). 

Procedure: (a) If b = 0, return a. 

(b) If b > 0, use integer division to write a = bq + r for some q,r € Z with 0 < r < b; 
return gcd(b, r). 


It can be proved, using strong induction on b, that this algorithm terminates with the 
correct answer after finitely many steps. The key observation is that the recursive call in 
step (b) replaces the second input b by the smaller nonnegative integer r. 


4.43. Example. Let us recompute gcd(88,60) using Euclid’s algorithm. Initially a = 88 
and b = 60, so we are in case (b) of the algorithm. We divide a by b, getting: 


88 = 60-1 + 28; gcd(88, 60) = gcd(60, 28). 
In the first recursive call, the inputs are 60 and 28. Dividing again gives 
60 = 28 - 2 + 4; gcd(60, 28) = gcd(28, 4). 
The new inputs are 28 and 4; dividing gives 
28 = 4-7 +0; gcd(28, 4) = gcd(4, 0). 


By case (a) in the algorithm, gcd(4, 0) = 4. Working our way back up through the recursive 
calls, we see gcd(88, 60) = 4. 


Although we described the gcd algorithm recursively, the previous example shows how to 
unravel this recursion into an iterative procedure consisting of a series of integer divisions. 
Specifically, assuming we start with positive inputs a and b, the algorithm performs the 
following divisions: 


a = bqa+rn, 0< rı <b; 
b = 1424+ 12, 0 < r2 < ri; 
rı = T2q3 +T3, 0 < r3 < T2; 
r2 = T3q4+fT4, 0 < r4 < r3; (4.1) 
Tk-2 = Tk-1qk+fTk, 0 < Tk < Tk-1; 
Tk-1 = Tkqk+1 +0, (remainder r41 is zero). 


When we finally reach a remainder rk} = 0, we return gcd(a,b) = rg, the last nonzero 
remainder. This must eventually occur, since b > rı > rg > r3 >--- and all remainders are 
nonnegative integers. 
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4.44. Example. Let us use the algorithm to compute gcd(171, 51): 


171 = 51-3418, (4.2) 
51 = 18-2415, (4.3) 
18 = 15-143, (4.4) 
15 = 3-5+0. (4.5) 


So ged(171, 51) = 3. 


Linear Combination Property of GCDs 


One of the most important properties of greatest common divisors is that for all a,b € Z, 
gcd(a,b) can be written in the form az + by for some integers x and y. In fact, we can 
enhance Euclid’s algorithm with some extra computations that find x and y explicitly. The 
idea is to work backwards through the chain of divisions to express gcd(a, b) (which is the 
last nonzero remainder) in terms of quantities appearing earlier. We illustrate this idea by 
revisiting the two previous examples. 


4.45. Example. We computed gcd(88, 60) = 4; let us now find x, y € Z with 4 = 88x+60y. 
Recall the divisions executed by the recursive algorithm to find gcd(88, 60): 


88 = 60-1 + 28; 60 = 28-244; 28 =4-7+0. 


The gcd 4 appears as the remainder in the second equation. Isolating the 4, we see that 
4 = 60 + 28- (—2). Now, from the first equation above, 28 = 88 — 60. Substituting this 
expression for 28 into our expression for 4, we find that 


4 = 60 + 28 - (—2) = 60 + (88 — 60) - (—2) = 88- (—2) + 60-3. 


The last step used the distributive law to collect terms; note that (88 —60)-(—2) contributes 
(—1)(—2) = 2 copies of 60, and there is one more copy of 60 added in front, for a total of 
3. We now see that 4 = 88x + 60y holds if we choose « = —2 and y = 3. 


Before looking at the next example, let us introduce some terminology. 


4.46. Definition: Linear Combinations. For all d, s,t € Z, 
d is a linear combination of s and t| iff |} da,y € Z,d = sx + ty |. 


4.47. Example. Let us find x,y € Z with 3 = gcd(171,51) = 171x + 5ly. Our strategy 
is to use the divisions (4.4), (4.3), (4.2) to express 3 as a linear combination of 18 and 15, 
then as a linear combination of 51 and 18, and finally as a linear combination of 171 and 
51. First, from (4.4), 

3=18-1+415-(-1). 


Now from (4.3), 15 = 51 + 18 - (—2). Substituting into the previous equation and collecting 
terms, 


3 =18-1+ [51 +18- (—2)]-(—1) = 18-3+51-(-1). 


Note that the new coefficient of 18 was computed from the distributive law as 1+(—2)(—1) = 
3. Now from (4.2), 18 = 171+51-(—3). Substituting into the previous equation and collecting 
terms, 


3 = [171 + 51- (—3)] -3 +51 - (—1) = 171 - 3 + 51 - (—10). 


Note that the new coefficient of 51 was computed from the distributive law as (—3) - 3 + 
(—1) = —10. So 3 = 171x + 5ly holds if we choose x = 3 and y = —10. 
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In summary, to find x and y with gcd(a, b) = ax + by: first find the gcd using repeated 
divisions as in (4.1); then work backwards to express the gcd rą as a linear combination of 
rk-2 and rp—1, then as a linear combination of r,_3 and rz—2, and so on, until finally rk 
has been expressed as a linear combination of the original inputs a and b. We do one more 
complete example to illustrate the full process. 


4.48. Example. Find gcd(999, 101), and find z,y € Z with gcd(999, 101) = 999x + 101y. 
Solution. First find the gcd by repeated divisions: 


999 = 101-9 + 90, 
101 = 90-1411, 


90 =11-8+2, 
11=2-5+1, 
2=2.1+0. 


So gcd(999, 101) = 1. Next, work backwards through the equations, taking care to collect 
common terms at each stage, as shown in the far right column below: 


1 =11+2-(—5) 


= 11 + [90 + 11- (—8)] - (—5) = 11-41 +90- (—5) 
= [101 + 90 - (—1)] - 41 + 90- (—5) = 101 - 41 +90- (—46) 
= 101 - 41 + (999 + 101 - (—9)) - (—46) = 101-455 + 999 - (—46). 


So 1 = 999x + 101ly holds with x = —46 and y = 455. 


We prove the linear combination property of gcds in §4.6. 


Matrix Reduction Algorithm for Computing GCDs (Optional) 


In this optional section, we present another algorithm for computing d = gcd(a,b) and 
finding integers x, y such that d = ax + by. This algorithm proceeds by performing certain 
elementary row operations on matrices with integer entries. There are three allowable row 
operations: (i) switch two rows in a matrix; (ii) multiply one row in a matrix by —1; (iii) add 
any integer multiple of one row of a matrix to a different row. 

You may have studied similar elementary row operations for real-valued matrices in 
linear algebra. In the real case, we can add any real multiple of one row to another row in 
operation (iii), and we can multiply a given row by any nonzero real number in operation 
(ii). These row operations are used in the Gaussian elimination algorithm to solve a system 
of linear equations by row-reducing the augmented matrix for the system. 

Our new algorithm for computing gcd(a, b) is analogous to Gaussian elimination, but 
we are only allowed to use the integer versions of the elementary row operations. Given 
nonzero integers a and b as input, we begin with the matrix 


1 Oja 

0 1) 6b J’ 
Next we perform a sequence of row operations whose goal is to produce a zero in the 
rightmost column. For example, we could start by dividing a by b, obtaining a quotient qı 


and a remainder rı such that a = bq, +1; and 0 < rı < |b|. Adding —q,; times row 2 to row 
1 produces the new matrix 

1 =q |r 

0 1} b? 
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If rı is nonzero, we could continue by dividing b by r1, obtaining q2 and rə with b = rıq2 +r2 
and 0 < rg < |r,|. Adding —q2 times row 1 to row 2 leads to the matrix 


rı 

T9 i 

We can continue in this way, following the same sequence of division steps used in (4.1) to 
row-reduce the matrix. We ultimately arrive at a matrix of the form 


T Y|Tk 
u v| 0’ 
where a zero has appeared in the rightmost column. Remarkably, the entries x, y, and rk 


in the other row now satisfy re = gcd(a, b) = ax + by. A proof of this assertion is outlined 
in the exercises. 


1 —=qı 
—q2 1+qq2 


4.49. Example. We illustrate the matrix reduction algorithm and its connection to the 
previous algorithm by solving Example 4.48 once again. The row-reduction steps are shown 
here: 


1 0|999 | R,-9R2} 1 —9} 90 Ro-R 1 —9 | 90 | R,-8Re 9 —89| 2 
1 0 1 | 101 —1 10/11 1 


Ro-5 Ra 9 —89 | 2 Rı—2Rə 101 —999 | 0 
—46 455] 1 —46 455 | 1 


Looking at row 2, we read off 1 = gcd(999, 101) = 999 - (—46) + 101 - 455. 


The matrix reduction algorithm is really the same as Euclid’s algorithm based on re- 
peated division, but the matrix formulation has several advantages. First, the algorithm 
both computes the gcd and writes it as a linear combination of the original inputs in a 
single computation; there is no need to backtrack through a chain of divisions to find x and 
y. Second, when executing the original algorithm by hand, it is easy to make arithmetic 
errors when substituting expressions into other expressions and using the distributive law. 
The matrix algorithm executes the same arithmetic via row operations, where mistakes are 
less likely. Third, we are free to use any row operations we like, not just those based on the 
divisions in (4.1). By choosing row operations wisely, we can sometimes reach the goal in 
fewer steps, as seen in the next example. 


4.50. Example. We compute gcd(119, 21) as follows: 


1 Of; 119 Ri -6Re 1 -6|-7 Rot3Ri 1 -6]|-7 Rix51 —1 6/7 
0 1| 21 0 14 21 3 —-17/ 0 3 -17/0 |` 


We see that ged(119, 21) = 7 = 119 - (—1) + 21-6. By subtracting six copies of Rs at stage 
1 instead of five copies, we made the gcd appear more quickly (up to a sign, which is easily 
removed). As additional savings, the row operation adding three copies of row 1 to row 2 is 
not really needed, once we notice that —7 divides 21 in the last column. 


Section Summary 
1. Facts about Divisors. For integers a and c, the notation cla means c divides a 
(i.e., Ju € Z,a = cu). For all a,b,c, z,y € Z, c divides a iff |c| divides |al; if c 
divides a and a > 0, then c < a; if cla and c|b, then c|(ax + by). 
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2. Greatest Common Divisors. For integers a and b (not both zero), gcd(a, b) is 
the largest integer d such that dja and d|b. We have gcd(a,b) = gcd(|al, |b]); 
gcd(a,0) = |a|; and if a = bq +r for some integers q,r, then gcd(a, b) = gcd(b, r). 
We can always write gcd(a,b) = ax + by for some integers x,y; we say that 
gcd(a, b) is a linear combination of a and b. 


3. Euclid’s GCD Algorithm. To find the ged of positive integers a and b, divide a by 
b to get a remainder r1; divide b by rı to get a remainder r2; divide rı by r2 to get 
a remainder r3; and so on. The last nonzero remainder rẹ is gcd(a, b). Working 
backwards through the divisions, we can express rz as a linear combination of 
rp—1 and rp—g, then as a linear combination of rk—2 and rz_3, until eventually rk 
has been written as a linear combination of the original inputs a and b. 


4. Matrix Reduction Algorithm for GCDs. To find the gcd of positive integers a and b, 
1 Oja 
0 1) 6 
the other row (or multiply one row by —1) until a zero appears in the last column. 
If the other row has entries z, y,d with d > 0, then d = gcd(a, b) = ax + by. 


start with the matrix 


| . Repeatedly add integer multiples of one row to 


Exercises 


1. For each pair a,b below, compute gcd(a, b) using repeated division. 
(a) a = 98, b = 21 (b) a = 228, b = 168 (c) a = 100, b = 39 (d) a = 513, 
b = 252. 

2. For each pair a,b in the previous problem, find integers x,y with gcd(a,b) = 
ax + by by working backwards through the chain of divisions. 

3. For each pair a,b below, compute d = gcd(a, b) and integers x,y with d = ax + by 
using the matrix reduction algorithm. 
(a)a=144,b=89 (b)a=516,b=215 (c) a= 111111, b = 117845. 

4. Prove: for all a,b € Z, gcd(a, b) = gcd(|al, |b|). 


5. Explain how Euclid’s algorithm computes gcd(a, b) in each of these special cases: 
(a) a = b = 0; (b) a = 0 Æ b; (c) b divides a; (d) a divides b. 

6. Prove by strong induction on b that the recursive Algorithm 4.42 always termi- 
nates in finitely many steps with the correct answer. 


7. Use induction on n to prove: for all integers n > 0,d,21,...,%n,@1,---,@n, if dla; 
for all i € {1,2,...,n}, then d|(a1xı +--+ + anzn). 

8. (a) Prove: for all positive integers a,b, gcd(b, a) = gcd(a, b) = gcd(a — b, b). 
(b) Use part (a) to formulate a new recursive algorithm for computing gcds. 
Compare this algorithm to Euclid’s gcd algorithm. 


9. Prove or disprove: for all positive integers a,b, gcd(a, b) = gcd(a + b,a — b). 
10. Suppose p is prime and a € Z. What are the possible values of gcd(a, p)? Under 
what conditions on a does each value occur? 


11. Prove: for all p,a,b € Z, if p is prime and p divides ab, then p divides a or p divides 
b. [Use the previous exercise and the fact that gcd(a, p) is a linear combination 
of a and p.] 


12. For fixed n € Zo, find all possible values of gcd(8n + 1, 5n — 3) and determine 
when each value occurs. Also write the gcd as a linear combination of 8n + 1 and 
bn — 3. 
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13. (a) Prove: for all n € Zo, ged(n3,n? +n +1) = 1. (b) Write 1 as an explicit 
linear combination of n? and n? +n +1. 


14. Prove: for all positive integers a, b, c, gcd(ca, cb) = c-gcd(a, b). [Hint: Examine how 
Euclid’s Algorithm acts on inputs ca,cb compared to inputs a,b by multiplying 
each line of (4.1) by c] 


15. Define the Fibonacci numbers by Fo = 0, Fy = 1, and F, = F,_-1 + Fy_2 for all 
n > 2. (a) Prove by induction: for all n > 1, gcd(Fn, Fn-1) = 1. (b) Forl <n < 6, 
express 1 as a linear combination of F and F;,-1. (c) Based on your answers to 
(b), conjecture and then prove a formula (valid for all n > 1) expressing 1 as a 
linear combination of F, and F,_1. 


16. Suppose a and b are fixed integers, A = | a i is a matrix with integer 
g! y' z! 
entries such that ax+by = z and au+bv = w, and B = | W a! | w' is obtained 


from A by performing one of the elementary row operations (i), (ii), or (ii) 
described on page 188. Prove that the entries of B are all integers, ax’ + by’ = 7’, 
au’ + bu’ = w', and ged(z, w) = ged(z', w’). 


17. Suppose a and b are fixed integers, Ao = ; A | i , and Ag, Aj, Ao,..., Ax is 
any finite sequence of matrices such that for 0 < i < k, Ai+1 is obtained from A; 


by an elementary row operation for integer matrices. Write A; = i align 


Use induction on k and the previous exercise to prove that for all 7 between 0 
and k, ax; + by; = zi, au; + bv; = wi and gcd(z;, wi) = gcd(a, b). 


18. Prove that the matrix reduction algorithm for finding gcd(a, b) always terminates 
in finitely many steps with the correct answer, assuming that at each step we 
perform any elementary row operation that reduces the maximum absolute value 
n of the two entries in the third column. (Use strong induction on n.) Also explain 
why, at each step before the end, at least one such row operation must be possible. 


19. Given a,b € Z>o, write a = bq +r where q,r € Z and 0 <r < b. Prove: for all 
x € Z>1, gcd(x* — 1, 2° — 1) = ged(z’ — 1, x” — 1). (Hint: See Exercise 21 in §4.4.) 


20. Use the previous exercise to prove: for all positive integers a,b,x with x > 1, 
ged(a® — 1, 2° — 1) = 84th) — 1, 
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4.6 GCDs and Uniqueness of Prime Factorizations 


In the last section, we looked at specific examples where gcd(a,b) could be expressed in 
the form ax + by for certain integers x and y. Here we prove that such expressions always 
exist. We then use this result to obtain further facts about prime integers. In particular, we 
prove that the factorization of a positive integer into a product of primes is unique except 
for rearranging the order of the factors. 


Proof of the Linear Combination Property of GCDs 


We now prove the linear combination property of gcds for all integers a,b > 0; an exercise 
asks you to generalize to the case where a or b could be negative. 


4.51. Theorem on Writing GCDs as Linear Combinations. For all integers a,b > 0, 
there are integers x and y such that gcd(a, b) = ax + by. 

Proof. To clarify the structure of the induction proof, we write the theorem statement in 
formal symbols as follows: 


Vb € Z>0, Va € Z50, da € Z, Jy € Z, gcd(a, b) = ax + by. 


We prove this statement by strong induction on b. Fix an arbitrary integer b € Z>o. [Next 
we must state the strong induction hypothesis. We change to new letters here to avoid a 
notation conflict between the assumed statement and the statement to be proved.] Assume: 
for all integers m in the range 0 < m < b, 


Ve € Z>0, ds € Z, 3t € Z, gcd(c,m) = cs + mt. (4.6) 


We must prove 


Va € Z>0, 3x € Z, Ay E Z, gcd(a, b) = ax + by. (4.7) 
We know b = 0 or b > 0, so consider two cases. 
Case 1. Assume b = 0; prove (4.7). Fix an arbitrary integer a € Z>ọ; prove Jx € Z, Jy € 
Z, gcd(a, 0) = ax + 0y. We know gcd(a, 0) = a, so choose x = 1 and y = 0. Note 1 and 0 are 
in Z, and gcd(a,0) =a=a-1+0-0. 
Case 2. Assume b > 0; prove (4.7). Fix an arbitrary integer a € Z>ọ; prove dx € Z, Jy € 
Z,gcd(a,b) = ax + by. Because b > 0, we can divide a by b, obtaining integers q,r with 
a = bq +r and0 < r < b. By the Theorem on Greatest Common Divisors, we know 
gcd(a,b) = gcd(b,r). The key observation is that 0 < r < b, so that we are allowed to 
choose m = r in the induction hypothesis. By the Inference Rule for ALL, we can take 
c = b in (4.6), so that the induction hypothesis tells us there are integers s and t with 
gcd(b, r) = bs + rt. We also know r = a — bq, and hence 


gcd(a, b) = gcd(b,r) = bs + rt = bs + (a — bq)t = at + b(s — qt). 


So, choosing x = t and y = s — qt (which are integers since s,q,t € Z), we do have 
gcd(a, b) = ax + by, as needed. This completes the induction proof. 


We can translate this proof into a recursive algorithm that takes inputs a,b € Z>9 and 
returns d, x,y € Z such that d = gcd(a, b) = ax + by. The base case of the algorithm occurs 
when b = 0; then we return d = a, x = 1, and y = 0. Otherwise, when b > 0, divide 
a by b to get q,r € Z with a = bq +r and 0 < r < b. Recursively compute d,s,t with 
d = gcd(b,r) = bs + rt; and return d as the gcd, x = t, and y = s — qt. Tracing through 
all the recursive calls, you can check that the net effect of this recursive algorithm agrees 
with the iterative computation presented in the last section, where we worked backwards 
through the chain of divisions to find x and y. 
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Properties of Prime Numbers 


In §4.4, we proved that for all n € Zy1, there exist factorizations of n into products of 
one or more primes. Our next major goal is to prove a uniqueness theorem about prime 
factorizations. The uniqueness of the factorizations is harder to prove than their existence, 
so we must first develop some facts about prime numbers. Recall that p € Zs is prime iff 
the only positive divisors of p are 1 and p. 


4.52. Euclid’s Lemma on Divisibility by Primes. For all p,a,b € Z, if p is prime and 
p divides ab, then p divides a or p divides b. 

Proof. Fix p,a,b € Z. Assume p is prime and p divides ab. Prove p divides a or p divides b. 
[Use the OR-template to continue.] Assume p does not divide a; prove p divides b. We have 
assumed p divides ab, which means ab = pc for some integer c. We must prove dd € Z, b = pd. 
[How do we proceed? We need to use our other assumptions: p is prime and p does not divide 
a. A key idea is that these assumptions allow us to compute gcd(p, a).] Since p is prime, the 
only positive divisors of p are 1 and p. Thus the greatest common divisor of p and a could 
only be 1 or p. Since we assumed p is not a divisor of a, we must have gcd(p,a) = 1. We 
now invoke Theorem 4.51 to conclude there exist integers x and y with 1 = ax + py. [We 
need one more trick that does not come from the proof templates: multiply both sides by 
b, to make ab appear on the right side.] Multiplying by b, using algebra, and recalling that 
ab = pc, we get 


b = (ax + py)b = axb + pyb = (ab)x + pyb = (pe)a + pyb = p(cx + yb). 
Choosing d to be the integer cx + yb, we have b = pd, as needed. 


4.53. Theorem on Divisibility by Primes. For all k € Zs, and all p, a1, a2,...,ax € Z, 
if p is prime and p divides aja2---ax, then for some i € {1,2,...,k}, p divides aj. 

Proof. We use ordinary induction on k. 

Base Case. Prove that for all p, a; € Z, if p is prime and p divides a1, then for some i € {1}, 
p divides a;. This statement is readily proved (choose i = 1). 

Induction Step. Fix an arbitrary k € Z>1; assume the theorem statement holds for this k; 
prove the theorem statement holds for k + 1. Fix p,aj,...,@x%,@¢41 € Z; assume p is prime 
and p divides a, ---@,ax~41; prove there exists i € {1,2,...,4 + 1} such that p divides a;. 
The key realization is that the number a, ---a,ax4 1 can be regarded as the product of the 
two integers a = a,---a, and b = ag44. Since p divides ab by assumption, the previous 
lemma applies to show that p divides a or p divides b. Now use cases. 

Case 1. Assume p divides a. This means p divides a a2---a,. By induction hypothesis, there 
exists i € {1,2,...,k} such that p divides a;, as needed. 

Case 2. Assume p divides b, so p divides az41. Choose i = k+1; then p divides a;, as needed. 


Uniqueness of Prime Factorizations 


Now we are ready to prove the uniqueness of prime factorizations. The next result is some- 
times called the Fundamental Theorem of Arithmetic. 


4.54. Theorem on Existence and Uniqueness of Prime Factorizations. 

(a) Every integer n > 1 can be written as the product of one or more primes. 

(b) For all n € Zyy, all s,t € Z>1, and all primes p1,...,ps,q1,---, 4, ifn = pipe---ps = 
M142°*:q@, then s = t and we can reorder the q; so that for all i € {1,2,...,s}, Pi = qi- 
Proof. Part (a) was proved in §4.4. We prove part (b) by strong induction on n. Fix an 
integer n € Zs, and assume part (b) holds for all n’ in the range 1 < n’ < n. Prove 
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the statement in part (b) holds for n. Fix s,t € Zs, and primes pi,...,Ps,@1,---;@:, and 
assume E 

n = P1p2*** Ps = 0102: tt. 
We know s = 1 or t = 1 or (s > 1 and t > 1), so use cases. 
Case 1. Assume s = 1. Then n = pı = qı: q+. In this case, t must be 1, since otherwise 
pı = qı (q2: q+) expresses pı as a product of the integers qı and q2- - q, which are both 
strictly between 1 and p,. This is impossible, since pı is prime. Knowing that t = 1, we now 
have s = t = 1 and p Sn = qı. 
Case 2. Assume t = 1. As in case 1, we see that s = 1 (because qı is prime), so s = t and 
Pı SNn= qi. 
Case 3. Assume s > 1 and ¢ > 1. On one hand, since n = pı (p2 pPs), we see that pı divides 
n. On the other hand, n = qı ++- qi, so the prime pı divides the product q,---q. By the 
Theorem on Divisibility by Primes, pı divides some q; where 1 < j < t. By reordering the 
qi if needed, we can arrange that pı divides qı. But qı is prime, so its only positive divisors 
are 1 and qı. As pı > 1, we must have pı = q1. Now we know 


pı (p2: Ps) =N = qq: Ge = piq qe). 


Dividing both sides by the nonzero integer pı, we obtain a new integer 


n’ = p2: -ps = 02`: qt. 
Note n’ < n since pı > 1, whereas n’ > 1 since s > 1 and ps > 1. Thus, n’ is an integer in 
the range 1 < n’ < n, so we can apply the induction hypothesis to n’. We have written n’ 
as the product of s — 1 primes pə2---ps and also as the product of t — 1 primes q2- -- qe. By 
the induction hypothesis, we conclude that s — 1 = t — 1 and q2,...,q is a rearrangement 
of p2,...,pPs. Since also pı = qı, we see that s = t and q1,...,q¢ is a rearrangement of 
P1,- --, Ps. This completes the induction proof. 


The Fundamental Theorem of Arithmetic can be rephrased as follows. 


4.55. Theorem on Prime Factorization of Integers. Let pı < po < p3 < --- be 
a list of all the distinct primes. For every nonzero integer n, there exist unique integers 
s € {+1,—1} and e1,€2,...,€j,... € Z>o such that all but finitely many e; are zero, and 


= €1 E2 ej c ej 
n = Spi Po ``: P; e= s [[ p7. 
j>1 


Proof. Let n be a fixed nonzero integer. We know n > 1 or n = 1 or n < 0, so use cases. 

Case 1. Assume n > 1. To prove existence of the factorization, use the earlier theorem to 
write n as a product of one or more primes, say n = q1g2---qz. Now choose s = +1, and 
for all 7 > 1, let ej be the number of times the prime p; appears in the list q1, q2,- --, qk- 
Here, all but finitely many e; are zero, and n = s] [ j>1 p7 since multiplication of integers 


is commutative. To prove uniqueness, suppose we also had n = s' [[ j>1 pr where s’ and e} 
satisfy the conditions in the theorem statement. We must have s’ = +1 = s since n > 0 and 
every p; > 0. Ifsome ej were unequal to ej, then we would have two prime factorizations n = 


Ll;>ı p; =TTjs1 p7 in which the two lists of primes were not rearrangements of each other, 
because p; occurs a different number of times in the two lists. This violates the uniqueness 
property in the earlier theorem. So ej = ej for all j, completing the proof of Case 1. 

Case 2. Assume n = 1. For existence, choose s = 1 and every ej = 0. For uniqueness, note 
that this is the only choice of s and e; that produces a positive product less than 2. 

Case 3. Assume n < 0. This case follows by applying the existence and uniqueness results 
already proved to —n > 0; the only required modification is to take s = —1 instead of 
s= +1. 
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Section Summary 


1. Linear Combination Property of GC'Ds. For all a,b € Z, there are integers x,y € Z 
with d = gcd(a,b) = ax + by. To find d,x,y when a > 0 and b = 0, let d = a, 
x = 1, y = 0. When a,b > 0, write a = bq +r for q,r € Z with 0 < r < b; 
recursively find d, s,t with d = gcd(b,r) = bs + rt; then gcd(a, b) = d, x = t, and 
y=s-qt. 

2. Properties of Primes. For prime p and a € Z, gcd(a, p) is p if p divides a and 
is 1 if p does not divide a. Whenever a prime p divides a product aiaz :--apg of 
integers, p must divide some a;. Every integer n > 1 can be written as a product 
of prime integers. The prime factorization is unique in the following sense: if 
n = pittt Ps = qıt: q With all p; and q; prime, then s = t and q,...,q@ is a 
rearrangement of p1, ..., Ps- 


3. Fundamental Theorem of Arithmetic. Existence and uniqueness of prime factor- 
izations can be reformulated as follows. Any nonzero integer n can be expressed 
uniquely in the form s]] j>1 p7 , where s is +1 or —1, the ej are nonnegative 
integers, all but finitely many e; are zero, and pı < pg < =: < pj < isa 
complete list of prime integers. 


Exercises 


1. For each a,b, find d = gcd(a,b) and integers x,y with d = ax + by using the 
recursive algorithm from this section. (a) a = 693, b = 525 (b) a = 999, b = 629 
(c) a = 34, b = 21. 

2. Extend Theorem 4.51 to the case where a or b could be negative. 

3. (a) Use the known identity gcd(a, b) = gcd(a — b, b) and strong induction on a + b 
to give another proof of Theorem 4.51. (b) Translate this proof into a recursive 
algorithm for finding d,z,y with d = gcd(a,b) = ax + by, and illustrate this 
algorithm on inputs a = 30, b = 8. 

4. Prove: For all a,b € Z, gcd(a,b) = 1 if and only if there exist x,y € Z with 
ax + by = 1. 

5. For all a,b,d € Z, we proved: if gcd(a, b) = d, then 3z, y € Z, ax + by = d. Is the 
converse of this statement always true? Prove your answer, and compare to the 
previous exercise. 


6. Prove: for all a,b,c € Z, c is a common divisor of a and b iff c divides gcd(a, b). 
[This conclusion strengthens the original definition of gcd(a,b), which initially 
tells us only that c is less than or equal to gcd(a, b).] 


7. Suppose p € Zs, is a fixed integer with the following property: for all a,b € Z, if 
p divides ab, then p divides a or p divides b. Must p be prime? Prove your answer. 


8. Prove: for all a,b,c € Z, if gcd(b, c) = 1 and clab, then cla. 


9. Prove: for all q € Q, there exists unique (m,n) € Z x Z with n > 0 and q = 
m/n and gcd(m,n) = 1. [This proves that every rational number has a unique 
representation as a fraction in lowest terms.] 

10. (a) Prove: for all r,s,t € Z, if gcd(r,t) = 1 = gcd(s,t), then gcd(rs,t) = 1. 
(b) Prove by induction on n > 0: for all integers r1,1r2,...,7n,¢, if gcd(r;,t) = 1 
for all i € {1,2,...,n}, then ged([]j_, rat) = 1. 
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11. 


12. 


13. 


14. 


15. 


16. 
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Give a new proof of Theorem 4.51 based on Exercise 18 of §4.1, as follows. Fix 
a,b € Zyo, and define S = {ax + by: x,y € Z} N Zo. Show that S has a least 
element d, and d = gcd(a, b). 


(a) Given a,b € Z, show that the integers x, y € Z such that gcd(a, b) = az+by are 
not uniquely determined by the pair (a, b). In fact, show that for each pair (a,b) € 
Z x Z, there are infinitely many pairs (x,y) € Z x Z with d = gcd(a, b) = ax + by. 
(b) Given fixed integers a,b, zo, yo with 1 = gcd(a,b) = azo + byo, explicitly 
describe (with proof) the set of all pairs of integers (x, y) solving 1 = ax + by. 

Prove that the iterative method for finding x,y with gcd(a, b) = ax + by (based 


on working backwards through the divisions in (4.1)) always gives the same x, y 
as the recursive algorithm described after the proof of Theorem 4.51. 


The gcd of a finite list of positive integers a1,...,@,, is the largest integer d such 
that dla; for all i between 1 and n. (It can be shown that such a d exists.) Use 
induction to prove: for all n € Z>2 and all a1,...,@n E€ Zso, ged(a1,...,an) = 
gcd(gcd(aq,..-,@n—1); Gn). 

Continuing the previous exercise, prove that for all positive integers n, a1,...,@n, 
there exist integers 21,...,@n with gcd(a1,...,@n) = a1%1 + +++ + anin. 
Describe an algorithm that takes aj,...,@, E€ Zsg as input and computes 


d,@1,...,%n E€ Z with d = gcd(ay,...,an) = a,x, +--+ + an£n. Prove that 
your algorithm terminates with the correct answer. 
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4.7 Consequences of Prime Factorization (Optional) 


This optional section explores some consequences of the Fundamental Theorem of Arith- 
metic. Recall that this theorem expresses each nonzero integer n in the form n = s[] j>1 p n 
where pı < po < +++ < pj < ++: is the list of all prime numbers, and the integers 
s € {+1,—1} and e€1,€2,...,€j,... E Zso are uniquely determined by n. Furthermore, 
only finitely many e; are nonzero. To emphasize that s and all e; depend on n and are 
uniquely determined by n, we write s = s(n) and e; = e;(n). So for all nonzero integers n, 


n= s(n) a”. 


j21 


Multiplication Rule for Prime Factorizations 


The following result shows how the prime factorization of the product of two nonzero inte- 
gers is related to the factorizations of each separate integer. Intuitively, the result reduces 
multiplication of nonzero integers to the addition of the vectors of exponents in the prime 
factorizations. 


4.56. Theorem on Products of Integers. For all nonzero n,m E Z, s(nm) = s(n)s(m) 
and e;(nm) = e;(n) + e;(m) for all j € Zs}. 
Proof. Fix nonzero n,m € Z. We have the factorizations 


Multiplying these together, we obtain 


nm = s(n)s(m) lee (4.8) 


j21 
On the other hand, we know the nonzero integer nm has a unique expression 


nm = s(nm) lie” (4.9) 


j21 


where s(nm) € {+1,—1}, each e;(nm) € Zo, and only finitely many values e;(nm) are 
nonzero. Since s(n)s(m) € {+1,—1}, every e;(n) + e;(m) is a nonnegative integer, and 
at most finitely many numbers e;(n) + e;(m) are nonzero, (4.8) satisfies the conditions 
required in (4.9). By uniqueness, we must therefore have s(nm) = s(n)s(m) and e;(nm) = 
e;(n) + e;(m) for all integers j > 1. 


Characterization of Divisors 


The next theorem shows how to use prime factorizations to detect when one integer is a 
divisor (or multiple) of another integer. 


4.57. Theorem on Factorization of Divisors. For all nonzero integers a and b, 

a is a divisor of b iff for all j € Zs1, e;(a) < e;(0). 

Proof. Fix a,b € Zz. First assume a is a divisor of b. Then there exists an integer c with 
b = ac; we have c # 0 since b Æ 0. Fix j € Z> . By the multiplication rule for prime 
factorizations, e;(b) = e;(a) + e;(c) > e;(a) +0, so e;(a) < ej(b). 
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Conversely, now assume that for all j € Zs1, e;(a) < e;(b). Prove alb, which means 
d € Z,b = ad. Choose 
e;(b)—e;(a 
d=s|] p” 5 ( ) 


j21 


Lu 


where s = 1 if a and b have the same sign, and s = —1 otherwise. Note that d is an 
integer, since the exponent of each p; is the nonnegative integer e,;(b) — e;(a), and at most 
finitely many of these exponents can be nonzero. On one hand, the choice of s shows that 
b and ad have the same sign. On the other hand, for all j € Z>1, e;(ad) = e;(a) + e;(d) = 
e; (a) + (e; (0) — e;(a)) = e;(b). So b = ad, as needed. 


Prime Factorization of GCDs and LCMs 


Our next result shows how to compute the greatest common divisor of two nonzero integers 
using their prime factorizations. First, we need one new definition. 


4.58. Definition: Minimum. For all x,y € R, define min(z,y) = x if £ < y, and 
min(z, y) =y if y< z. 


This definition says that min(x, y) is the smaller of the two numbers x and y. You are 
asked to prove the next lemma in the exercises. 


4.59. Lemma on the Minimum. For all x,y,z € R, z < min(a,y) if z < x and z < y. 


The next theorem shows that we can calculate greatest common divisors by comparing 
prime factorizations and taking the minimum exponent of each prime. 


4.60. Theorem on Prime Factorization of GCDs. For all nonzero integers a and b, 
d b) = f min(ej (a),e;(b)) 
gc (a, ) I;> Pj i 


Proof. Fix nonzero integers a and b, and let d = []js1P i()) First we show d is 
a common divisor of a and b. For every j € Zs1, e;(d) = min(e;(a), e;(b)) < e;(a), so dla 
by the Theorem on Factorization of Divisors. Similarly, since e;(d) < e;(b) for all j, d|b 
follows. To finish the proof, we show that d is the largest common divisor of a and b. Fix 
any integer f such that f|a and f|b. We prove that f < d by showing the stronger statement 
f\d. Because fla and f|b, e;(f) < e;(a) and e;(f) < e;(b) for all j > 1. By the Lemma on 
the Minimum, e;(f) < min(e,;(a),e;(6)) = e;(d) for all j. This implies f|d by the Theorem 
on Factorization of Divisors. We have now proved that d = gcd(a, b). 


min(e; (a) 


We can extend the definitions of gcd and min to apply to k inputs instead of two inputs 
(see Exercise 14 of §4.6). The proof above extends to show 
gcd(a, EN ak) = g nes RD) 
j21 


A similar analysis is possible for the least common multiple (lcm) of two or more integers. 
By definition, lem(a,,...,a,) is the least positive integer d such that every a; divides d (we 
assume every a; is nonzero here). The proof for gcds can be adapted to show 


max(e; (a1),.--,€; (@k)) 


Iem(a1,..., ak) = j 


j21 


where max(21,..., 2%) is the largest of the real numbers 21,..., Zk. 
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Prime Factorization of Rational Numbers 


The prime factorization theorem for nonzero integers can be extended to a prime factoriza- 
tion theorem for nonzero rational numbers by allowing negative exponents. We continue to 
assume that pı < p2 < ++- < pj <+- is the list of all prime integers. 


4.61. Theorem on Unique Prime Factorization of Rational Numbers. For every 
nonzero q € Q, there exist unique integers s(q) € {+1,—1} and e;(q) for j € Z>1, such that 
all but finitely many e;(q) are zero, and 


a= sla) [J 75". 


j21 


Proof. Fix nonzero q € Q. First we prove existence of the claimed factorization. Since q 
is rational, we know there are integers n and m such that m #4 0 and q = n/m. Note 
n Æ 0, since q 4 0. By the Theorem on Prime Factorization of Integers, we know n = 


ej(m) 


s(n) [se and m = s(m) [131 2; . Dividing n by m, 


a =n/m = [s(n)/s(m)] T] vy. 


j21 


Choosing s(q) = s(n)/s(m), which is in {+1,—1}, and (for each j) choosing e;(q) = e;(n) — 
e;(m), which is in Z since e;(n),e;(m) € Z>o0, we have found a factorization of q of the 
required type. 

Proving uniqueness of the factorization is a little tricky, since n and m are not uniquely 


determined by g. Assume 
q=8 [[27 = 5 II p7 (4.10) 


j21 j21 


where the integers s and ej as well as the integers s’ and ej satisfy all the conditions in the 
theorem statement. We must prove s = s’ and ej = ej for all j € Z>1. If q is positive, we 
must have s = +1 = s’; while if q is negative, we must have s = —1 = s’. To deal with the 
exponents, we divide the index set Zs into three subsets. Let A = {j € Zs) : €j = eh let 
B = {j € Z>1 : ej < ej}, and let C = {j € Zp1 : ej > e}. Our goal is to prove B = C = 0. 
To begin, we can divide both expressions for q in (4.10) by the common factor s |] (<4 p3, 


which leaves ; ; 
Her He = [e7 L27. 


jEB jee jEB  jEeC 
Here and below, we interpret a product over an empty set of indices as 1.) Next, divide 
, ° y 
both sides by Ilen 2; Leen to get 


Ile? = i. (4.11) 


jEC jEB 


By the definitions of B and C, the left side is either 1 or a product of strictly positive powers 
of primes p; with j € C; and the right side is either 1 or a product of strictly positive powers 
of primes p; with j € B. Consider various cases that might occur. 

Case 1. Assume B # Ø and C # Q. Note that the primes p; for j € B are all different from the 
primes pj for j € C, since B and C cannot overlap. Now (4.11) displays two different prime 
factorizations of a positive integer, violating uniqueness of prime factorization of integers. 
Case 2. Assume B = and C # Ø. Then the right side of (4.11) is 1, whereas the left side 
is larger than 1, which is impossible. 


200 An Introduction to Mathematical Proofs 


Case 3. Assume B 4 Ú and C = Ø. Then the left side of (4.11) is 1, whereas the right side 
is larger than 1, which is impossible. 

Case 4. Assume B = § and C = 9. This is the only possible case, and in this case we must 
have ej = ej for all j E€ Z> 1, as needed. This completes the uniqueness proof. 


Proving Irrationality of nth Roots 


In §2.4, we proved that v2 is irrational using proof by contradiction. Now that we have 
the unique factorization theorem for rational numbers, we can quickly prove a much more 
general result that tells us exactly which numbers /q (where q E€ Q>o and n € Zs}) are 
rational. In using the notation ~/q, we are tacitly assuming the following fact about real 
numbers: for every positive real number x and every positive integer n, there exists a unique 
positive real number w such that w” = x; by definition, ~/z is the unique such w solving 
w” = x. This can be proved using the Intermediate Value Theorem from calculus; the case 
of square roots (n = 2) is proved in Theorem 8.61. Informally, our main result says that the 
only positive rational numbers q with rational nth roots are those for which the exponents 
of all primes in the factorization of g are multiples of n. 


4.62. Theorem on Rationality of Roots. For all q E€ Q>o and all n € Zsi, Yq is 
rational iff for all 7 € Z>1, n divides e;(q); and in this case, 


yq = E 


j21 


Proof. Fix q E€ Qso and n € Zs. By the fact quoted above, let w € Ro be the unique 
real number such that w” = q. Part 1. Assume w € Q, and prove n divides every e;(q). 
Since w is positive and rational, it has a prime factorization w = |] j>1 Py (w) Raising this 
expression to the power n, 


po =w=q= jae: 
j21 j21 


Since the prime factorization of q is unique, we conclude that e;(q) = ne;(w) for every 
j = 1. Since e;(w) € Z, we see that n divides every e;(q). Furthermore, e;(w) = e;(q)/n, 
so the last formula in the theorem statement holds. 

Part 2. Assume n divides every e;(q); prove w € Q. Let wi = I> py ” which is a well- 
defined positive rational number since every e;(q)/n is an integer. By direct computation, 


(a)/ 


we = I;> Py (a) _ q = w”. By the fact quoted above, w is the unique positive real number 


whose nth power is q, so w = w, € Q, as needed. 

4.63. Example. The rational number q = 0.729 = 278365- has rational cube root 3/q@ = 
0.9 = 2~13°5~1. But \/q is not rational since e1 (q) = —3 is not divisible by 2. More generally, 
for all n € Zs — {3}, %4 is not rational. 


Similarly, it follows from the theorem that the only positive integers k such that Vk is 
rational are integers of the form k = j? for some positive integer j; the only k € Zso such 
that Yk € Q are those of the form k = j’ for some j € Zs; and so on. In particular, for 
every prime integer p, the numbers ,/p, ẹ/p, etc., are all irrational. 


Section Summary 


Let pı < p2 < +- < pj <+- be the list of all prime integers. 


Integers 


1. 


Computing with Prime Factorizations. Given positive integers n and m with prime 


factorizations n = UEN i and m = lar we have: 


nm = eee n divides m iff e;(n) < e;(m) for all j > 1; 
j21 


max(ej (n),e;(m)) 


min(ej (n),e;(m)) , 
? J l 


j and lcm(n, m) = 
j21 j21 


gcd(n,m) = 


Similar formulas hold for the product, gcd, and lcm of a finite list of nonzero 
integers. 


Prime Factorization of Rational Numbers. For every nonzero rational number q, 
there exist unique s € {+1,—1} and unique integers e;(q) for j > 1, such that all 
but finitely many e;(q) are zero, and q = s(q) Iar i 

nth Roots of Rational Numbers. For any positive rational q = Tiere” and 
n € Zs, %/q is rational iff n divides e;(q) for all j > 1. 


Exercises 


1. 


ene eo aoe 


12. 


13. 


Let n = 253374, m = 23355271, and s = 345472. 

a) Find gcd(n,m), gcd(m, s), ged(n, s), and gcd(n, m, s). 
b) Find lem(n, m), lem(m, s), lem(n, s), and lem(n, m, s). 
c) Are any of yn, ym, or y's rational? 

Find the prime factorizations of each rational number. 

a) 0.625 (b) 212/121 (c) 2.64 (d) 3300/77077. 


a) Prove the Lemma on Minimums. (b) State and prove an analogous result for 
max(zx, y). 


Suppose 21,...,@, are fixed real numbers. (a) Give a recursive definition of 
min(#1,...,%%) and max(#1,...,2%). (b) Generalize the results of the previous 
exercise to apply to this situation. 

ak) as Il prin(es (a1)s-69 (4K) 


Given integers a1,...,a% > 0, prove gcd(ay,.. joi Pj 


Given integers a,,...,a, > 0, prove lem(a,,...,a,) = Lao AN maa? eg (ae) 


Extend Theorem 4.56 to the case where n and m are nonzero rational numbers. 


Given nonzero rational numbers q and r, find and prove a formula for the prime 
factorization of q/r. 

Use prime factorizations to prove: for all r, s,t € Zso, if gcd(r,t) = 1 = gcd(s, t), 
then ged(rs,t) = 1. 

Use prime factorizations to prove: for all a,b,c € Zso, if gcd(a,b) = 1 and a 
divides bc, then a divides c. 

(a) Prove or disprove: for all m,n € Z>o, if m > n, then e;(m) > e;(n) for all 
j = 1. (b) Prove or disprove: for all m,n € Zyo, if e;(m) > e;(n) for all j > 1 
then m >n. 

Prove: for all nonzero a,b € Q, there exists n € Z with b = na iff for all j € Z>1, 
ez(a) < e;(b). 

Prove: for all positive integers a,b,c, gcd(ca, cb) = c- gcd(a, b) using prime fac- 
torizations. (See Exercise 14 in §4.5 for a different proof.) 
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14. 


15. 


16. 


17. 


18. 


19. 
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Prove: for all positive integers a,b,d, gcd(a,b) = d iff d|a and d|b and 
gcd(a/d, b/d) = 1. 

Suppose we define divisibility in Q as follows: for all q,r € Q, q divides r iff 
Js € Q,r = sq. Characterize all pairs (q,r) such that q divides r under this 
definition. 


(a) Prove: for all x,y € R, z +y = min(z, y) + max(x, y). (b) Deduce that for all 
a,b € Zzo, ab = gcd(a, b) - lem(a, b). 


Prove: for all m,n € Z>o, if there exists q € Q with q” = m, then there exists 
k € Z with k” = m. [Thus if an integer has a rational nth root, then the root 
must actually be an integer.| 


Given a nonzero q € Q, let P(q) be the set of primes p; such that e;(q) 4 0. 
(a) Prove: for all nonzero q € Q, P(—q) = P(q). (b) Prove: for all nonzero 
q,r € Q, P(gr) C P(q) U P(r). Show that equality holds if P(g) N P(r) = 0. 


(a) Given nonzero rational numbers q,...,q%, define rational versions of 
gcd(qi,.--,q¢) and lem(qi,...,q). (b) For nonzero q,r € Q, can we always write 
gcd(q,r) = qx + ry for some integers x and y? 


Review of Set Theory and Integers 


Tables 4.1—4.6 on the following pages review the main definitions, proof templates, and 
theorems covered in the preceding chapters on set theory and integers. All unquantified 
variables in the tables represent arbitrary objects, sets, real numbers, or integers (as required 
by context). 


Set Theory Pitfalls 
(a) Order and repetition do not matter in sets. For example, {1,2} = {2,1} = {1,1, 2, 2}. 


(b) Distinguishing € and C. A € B means the object A is a member of the set B. AC B 
means every member of the set A is a member of the set B. Do not confuse the set mem- 
bership relation (A € B) with the subset relation (A C B). Note that sets are allowed to 
be members of other sets. 


(c) Types of Brackets. There are three kinds of brackets used in set theory: round 
parentheses (...), square brackets [---], and curly braces {...}. Each shape of bracket 
has a different meaning: use round parentheses for ordered pairs or open intervals, use 
square brackets for closed intervals, and use curly braces for sets. Adding more braces 
changes the meaning; for example, Ø and {Ø} and {{0}} are all different sets. Similarly, 
S = {1, {2,3, 4}, 5,6, {{7}, 8}, [3, 5]} is a set with six members: the integers 1, 5, and 6; the 
set containing 2 and 3 and 4; another set with members 8 and {7}; and the closed interval 
[3,5]. 


Theorems on Divisibility, Primes, and GCDs 


1. Integer Division Theorem. For all integers a and b with b Æ 0, there exist unique 
integers q,r with a = bq +r and 0 < r < |b|. We call q the quotient and r the 
remainder when a is divided by b. 


2. Theorem on Computing GCDs. For all a € Z, gcd(a, 0) = |a|. For all a,b, q,r € Z, 
if a = bq +r then gcd(a, b) = gcd(b, r). 

3. Linear Combination Property of GCDs. For all a,b € Z, there exist x,y € Z 
such that gcd(a, b) = ax + by. This result extends to the gcd of any finite list of 
integers. 


4. Euclid’s Recursive GCD Algorithm. 
Input: a,b € Zo. 
Output: d = gcd(a, b), and x,y € Z with d = az + by. 
Procedure: 1. If b = 0, return d = a, x = 1, y = 0. 
2. If b > 0, divide a by b to get a = bq +r with0 <r <b. 
3. Recursively compute d’ = gcd(b,r) and s,t € Z with d' = bs + rt. 
4. Return d = d', x = t, and y = s — qt. 
The iterative version of this algorithm uses repeated division to find gcd(a, b) (the 
last nonzero remainder), then works backwards through the divisions to find x 
and y. 
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TABLE 4.1 
Set Definitions. 
Concept Defined Term Definition Text 
Set Membership x € A [undefined term] (informal) x is a member of A. 
cEéA ~(a € A). 
Subset ACB Va,nEASxreB. 
AZB Jr, €E AAT # B. 
Proper Subset ACB A È B and AFB. 
Set Equality A=B Va,rEAsrxeB 


(equivalently: A C B and B C A). 


Union zre AUB cxeEAorreB. 
Intersection xEANB xE AandzEB. 
Set Difference zeA-B xE Aandrzg B. 
Empty Set red x € Í is false for all x. 
B=0 Va,a ¢ B. 
B Æ 0 da,x € B. 
Singleton Set z € {a} z=a. 
Unordered Pair z € {a,b} z=aorz=b. 
Unordered Triple z € {a,b,c} z=aorz=borz=c. 
Ordered Pair (a,b) [undefined term] Axiom: Va, Vb, Vc, Vd, 


(a,b) = (c,d) & (a=cAb=d). 
for, define (a,b) = ta), {a, b}}] 


Open Interval x € (a,b) xe€Randa<«x<b 

Closed Interval x € [a,b] xe€Randa<a< b. 

Half-Open Interval x € [a,b) xe€Randa<a<ob. 

Half-Open Interval <æ € (a, 0] cé€Randa<a<b. 

Power Set SEP(X) SCX. 

Product Set zExxY Jx, Jy, (x E XAyEY)Az= (z,y). 
(a,b) EXxXY a€ X andbeY. 

Indexed Union z € Ucr 4i Ji € I,z € Aj. 

Indexed Intersection z € [fjer Ai Vi € I,z € A; (need I #9). 


Review of Set Theory and Integers 205 


TABLE 4.2 

Proof Templates Involving Sets and Induction. 
Statement Proof Template to Prove Statement 
ACB Fix an arbitrary object xo. 


Assume zo € A. 
Prove xo € B. 
[To continue, expand definitions of “ag € A” and “ao € B.”| 
SxTCC Fix an arbitrary ordered pair (a,b). 
Assume (a,b) € S xT, 
which means a € S and b E T. 
Prove (a,b) € C 
[by expanding the definitions of membership in S, T, and C]. 


A=B Part 1. Prove AC B. 
(set equality) Part 2. Prove B C A. 
A=B Fix an arbitrary object £o. 


(set equality) Prove zo € A © zo € B by a chain proof. 
[If A and B contain only ordered pairs, replace zo by (x, y).] 
A=0 Assume, to get a contradiction, that A Æ f. 
We have assumed Jz, z € A. 
Expand the definition of “z € A,” 
and try to find a contradiction. 
Yn € Z>, P(n) We use (ordinary) induction on n. 
7 1. Base Case. Prove P(b). 
2a. Induction Step. Fix an arbitrary integer n € Z>p. 
2b. Induction Hypothesis. Assume P(n). 
2c. Prove P(n +1). 
[Use P(n) to help prove P(n + 1).] 
Yn € Z>, P(n) We use strong induction on n. 
1. Fix an arbitrary integer n € Z>». 
2a. Strong Induction Hypothesis. Assume that for all 
integers m in the range b < m < n, P(m) is true. 
2b. Prove P(n). 
(Small values of n may require separate cases. Use P(m), 
for well-chosen values of m < n, to help prove P(n).] 
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TABLE 4.3 

Properties of Subsets. 
Reflexivity 
Antisymmetry 
Transitivity 
Lower Bound 


Greatest Lower Bound 


Upper Bound 


Least Upper Bound 


Least Element 


Difference Property 


Monotonicity of N 
Monotonicity of U 
Monotonicity of — 


Inclusion Reversal of — 
Subset Characterizations 


TABLE 4.4 

Set Equality Theorems. 
Reflexivity 
Symmetry 
Transitivity 
Commutativity 
Associativity 
Distributive Law 1 
Distributive Law 2 
Idempotent Laws 
Absorption Laws 
Properties of Ø 


De Morgan Law 1 
De Morgan Law 2 
Set Decomposition 
Difference Properties 
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DD 
IN Il 
Qu 


Aand ANBCB. 
Biff (C C A and C CB). 
B and BC AUB. 
C if (AC C and BCC). 


Cae he 


AUB=Bs A-B=O. 


DP PPP mS mR QR OD 
INININIAIN FIA Cinin D 
wnawyndst Band 


ge bd plo 


C-A 
As 


D 
D 


A=A. 

A=B=>B=A. 

(A=BAB=C)SA=C. 

AUB=BUA, ANB=BNQNA. 
(ANB)NC=AN(BNC), (AUB)UC=AU(BUC). 
AN(BUC) =(ANB)U(ANC). 
AU(BNC)=(AUB)N(AUC). 

ANA=A, AUA=A. 


AU(ANB)=A, AN(AUB)H=A. 
AU@=A, ANO=0, Axd=0, OxA=F, 
A-@=A, 09-A=0, A-A=@O. 
A—(BUC) =(A-B)n(A-C) 

A—(BNC) =(A-B)uU(A-C). 
A=(A-B)U(ANB), (A—B)N(ANB)=6 
ASAI BY ASB LAB) Aes 
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TABLE 4.5 
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Properties of Power Sets, Product Sets, and General Unions and Intersections. 


Power Sets: 
Min/Max Subsets 
Monotonicity 
Intersection Property 


fe P(A), AE P(A). 


xe 
3 


Union Property P(A)UP(B) CP(AUB). 

Difference Property P(A- B)- {0} C P(A) — P(B) 

Product Sets: 

Monotonicity (ACCABCD)3SAxBCCxD 

Distributive Laws (AUB)xC=(AxC)U(Bx OC), 
(AN B)xC=(AxC)N(Bx C), 
(A-B) x C= (Ax C)—(BxC), 
similarly if C appears in the first factor. 


Intersection Property 
Union Property 


(Ax B)N(Cx D 


)=(ANC) x (BND). 
(Ax B)U(Cx D)C 


AUC) x (BUD). 


—_— 


Unions and Intersections: 


Monotonicity of | 
Monotonicity of (1) 
De Morgan Law 1 
De Morgan Law 2 
Distributive Laws 


Lower and Upper Bound 
Greatest Lower Bound 
Least Upper Bound 
Enlarging the Index Set 


Combining Index Sets 


(Vi E I, Ai G Bi) = User Ai (E Wier Bi- 
(vi € I, A; C Bi) > Nier Ai E icr Bi- 
N Vier A; = Mier X S Aj). 
X- Mier A; = Uier(X = Aj). 
XO Uier 4i = User (XM Ai); 
XU Mier A, = Nicer X U Ai), 
Xx Uier B; = Uier(X x Bj), 
X x Mier Bi = Nicr(X x Bi), 


similarly if X appears as the second factor. 
Vk € I, Mier 4i E Ark C Ucr 4i 

XE erd iff (Vi € I, KE AD. 

User Ai C X iff (Vi € I, Ay C X). 

FOF => User Ai cE Wend; 

ICJ=> Ney A jS Ther Ai 

(Vier Aj) U (Ues Aj) E Ukrerus Ak, 

(Mier Ai) N (Mjes Aj) = Nrerus Ak 
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TABLE 4.6 

Definitions Involving Recursion or Divisibility. 

Concept Defined Term Definition Text 

Sum X k= Tk Ekat Tk = th, 


(for fixed b < n) Erti er = (Xk p Tk) + Enpi- 


Product Iip £k I- £k = 20, 

(for fixed b < n) ise £r = (leg te) this 
Factorials n! (n € Zso) 0!=1, (n+1)!=(n4+1)-nl. 
Exponents x” (x E€ R,n € Z>0o) x° =1, °t! = gr”. x. 
Fibonacci sequence Fn (n € Zso) Fo =0, F, = 1, 


Fn = Fn_-1 + Fn—o for all n € Z>2. 


Divisor a|b (a divides b) du € Z,b = au. 
Common Divisor d|a and d|b ds,t € Z,a = ds and b = dt. 
Greatest Common Divisor d = gcd(a, b) d|a \ d|b A Yc € Z>o, (cla A clb > c < d). 
Least Common Multiple £ = lcm(a, b) all A ble A Ve € Zso, (ale A ble > £ < e). 
Composite n is composite n € Zs, and Ja, (1 <a < n^ ajn). 
Prime n is prime n € Zs, and ~Ja, (1 <a < n^ajn). 
Rational rEQ dm € Z,An€ Z,n40A2=m/n. 

5. Theorem on Divisibility by Primes. For all primes p and all a1,...,ax € Z, if p 


divides a,a2---axz, then p divides some a;. 


Infinitude of Primes. For every integer n, there exists a prime p > n. So there 
are infinitely many primes. 


Unique Prime Factorization of Integers. Let pı < po < +++ < pj <» be the 
list of all prime integers. For all nonzero n € Z, there exist unique integers 
s(n) € {+1,—1} and e;(n) > 0 (for j = 1,2,3,...) such that all but finitely 
many e;(n) are zero, and n = s(n) J In words, every nonzero integer n 
j>1 

can be written as a product of zero ot more primes, times a sign factor. The sign 
and primes occurring in the factorization (including the number of times a given 
prime appears) are uniquely determined by n. 

Unique Prime Factorization of Rational Numbers. Let pı < po < -+> < pj <= 
be the list of all prime integers. For all nonzero q € Q, there exist unique integers 
s(q) € {+1, -1} and e;(q) € Z (for j = 1,2,3,...) such that all but finitely many 
e;(q) are zero, and q = s(q) [27 (@) 

j21 

Divisibility Concepts and Prime Factorizations. For all nonzero x,y € Q and all 
j € Z>1, s(xy) = s(x)s(y) and e; (xy) = e; (a) + e;(y). For all nonzero a,b € Z, a 


divides b iff for all j € Z>1, e;(a) < e;(b). For all nonzero a1,..., ax € Z, 
ecd(ay,...,a%) = io Se ae 


j21 


lcem(a1,..., ak) = anpa Six ez (an) 


j21 
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For all q E€ Qso and n € Z>1, 4/4 is rational iff for all j € Z>1, n divides e;(q); 
and in this case, 4/4 = [[;>; gc 
10. Summation Formulas. (These can be proved by induction.) 
Lr L=n—-bt+l. 
Li k = n(n + 1)/2. 
Tia k? = n(n + 1)(2n + 1)/6. 
ka K = n? (n + 1)?/4. 
reo” = (r?*t — 1)/(r — 1) for all real r 4 1 
Dpp (@e + Yk) = X pp Te t} pp Yk for all £k, yx € R and all n > b. 
pp (ate) = a X pp Tk for all a, £p € R and all n > b. 


Review Problems 


1. Complete the following definitions and theorem statement. 
(are AUB iff ... 
(b) GE Nex A; iff ... 
(c) (y,z) EC x D iff... 
(d) Integer Division Theorem: ... 


2. Use Euclid’s algorithm to compute gcd(210,51), and find integers x and y such 
that gcd(210, 51) = 210z + 5ly. 


3. Prove this statement by ordinary induction: for all n € Zs2, X pv zT D= = nl 


4. Let A, B, C be fixed sets. Prove using only the definitions: 
f BAQACC andB-—ACC,then BCC. 


5. Define a sequence recursively by setting ag = 8, ay = 13, and an = 7an_—1 —6an_2 
for all integers n > 2. Prove by strong induction: for all n E€ Z>0, an = 6” +7. 


6. (a) Let A be a fixed set. Which of the following sets must equal the empty set? 
AUD, ANO, A—0, Ax 0, P(O), {0}. 
b) Disprove: for all sets B, BN P(B) = 90. 


7. True or false? Explain briefly. 

a) {3,5,8} = {5,8,3}. 

b) {1,2,2, 4,4,4,4} = {1,2,4}. 

) {2} © {H2}. 

) {7,8} € {{7, 8}, 9}. 

) {7,8} © {{7, 8}, 9}. 

) For all sets A and B, AU B = BUA. 
) For all sets A and B, Ax B=B x A. 
) For all sets A and B, Ax BÆB xA. 
i) For all sets A, Ø € A. 

j) For all sets A, Ø C A. 


8. For every positive integer n, define a set An = [1/n,n + 2] U {—n}. 
Draw pictures of each set (on a number line or in R?, as appropriate). 
co 


(a) Ay x A2 (b) U An (c) Ag — Ay (d) A2 x A3 


c 
d 
e 
f 
8 
h 


9. Use Euclid’s Algorithm to compute gcd(528, 209), and find integers x and y such 
that gcd(528, 209) = 528x + 209y. 
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10. 


11. 


12. 


13. 


14. 


15. 
16. 


17. 
18. 


19. 
20. 
21. 


22. 


23. 
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Prove using only the definitions: for all sets A, B, C, 


Ax(BUC)=(AxB)uU(AxC). 


Define a sequence recursively by ag = 0 and a@n41 = 3an + 2 for all n € Z>o. 
Compute a1, a2, a3, and a4, guess a closed (non-recursive) formula for an, and 
then prove your guess by ordinary induction. 

Prove using only the definitions: for all sets A and B, A — (A — B) = AQ B. 
Write just the outline of a proof by strong induction of the following statement: 
“Every odd integer n > 7 can be written as the sum of three primes.” 

For each n € Z>1, define a set A, = [1/n,n+3)—{n}. Draw each of the following 
sets on a number line. (a) UZ; An (b) NZ; An (c) (c)A2 — Ai (d) Ae x As. 
Prove: for all sets A, B, C, (A — B) x C = (A x C) — (B x ©). 

Prove: for all n € Z, the following statements are equivalent: 4 divides n + 2; 4 
divides n — 2; n is even and n/2 is odd; 2 divides n but 4 does not divide n. 
Prove: for all sets A, B, C, if ANC = BANC and AUC = BUC, then A= B. 
Let I and J be fixed, nonempty index sets. Let A; (for each i € I) and A; (for 
each j € J) be fixed sets. Give two proofs of this fact: (Q,<; Ai) N (ies 45) = 


Mkeruz 4k- In the first proof, use only the definitions. In the second proof, try 
to use previously known theorems wherever possible (except the result being 
proved). 


Compute d = gcd(1702, 483), and find integers x,y with d = 1702x + 483y. 
Prove: Va € R,Wn € Z>0, opp 45" = a(5” t! — 1)/4. 


Define cı = cg = 1, and cn = 2cn-1 +3cn-2 for all n > 3. (a) Prove that cn > 3"~? 
for all n > 3. (b) Prove that cn < 2-3"~? for all n > 3. 


(a) Prove: for all sets A and B, P(A — B) — {0} C P(A) — P(B). 
(b) Disprove: for all sets A and B, P(A — B) —@ C P(A) — P(B). 


Recursively define a sequence of sets T, as follows. Define Ty = Ø. For all integers 
n > 0, define T,41 = Tn U {Tn}. Prove: 


Vn E€ Z>0, Yx, (x E€ Tn >£ C Ta). 
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Relations and Functions 


5.1 Relations 


In this chapter, we study relations and functions, which are two of the most central ideas 
in mathematics. Informally, a function f is a rule that takes each object x in some set of 
inputs and transforms it into a new object y = f(x) in some set of outputs. A key point is 
that for every allowable input x, there must exist exactly one output y assigned to this x by 
the function. One of our main goals is to translate this informal description into a formal 
definition within set theory. To do so, we first define and study relations, which can be 
regarded as generalizations of functions in which each input x can be associated with more 
than one output, or perhaps no outputs at all. Many important mathematical concepts — 
including functions, partial orders, and equivalence relations — can be developed within 
the unifying general framework of relation theory. 


Relations 


In set theory, a relation is any set of ordered pairs. If R is a relation and (x,y) € R, we can 
think of xz as an input that is associated with the output y. In this setting, it is important 
to keep track of which objects might appear as potential inputs and outputs. This leads to 
the following definition. 


5.1. Definition: Relations. For all sets R, X, Y: 
Ris a relation from X to Y | iff} RC Xx Y |. 


This means that every member of R has the form (x,y), for some xz € X and some y € Y. 


5.2. Example: Arrow Diagram of a Relation. Let X = {1,2,3,4,5,6} and Y = 
{a,,c,d,e, f}. Then R = {(1,c),(1,e), (1, F), (3, F), (4, a), (4, b), (4, F), (6, ¢)} is a relation 
from X to Y. We can visualize the relation R by the following arrow diagram. 
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The arrow diagram consists of an oval with members of X (potential inputs for R) on the 
left side and an oval with members of Y (potential outputs for R) on the right side. For all 
x € X and y € Y, we draw an arrow pointing from z to y iff (x,y) € R. When X and Y 
are finite sets, this arrow diagram contains exactly the same information as the set R. 

In the current example, we see that the input 1 is related (under R) to the outputs c and 
e and f. Input 3 is related to output f only. Inputs 2 and 5 are not related to any outputs 
under R. Similarly, d is a possible output that does not get associated with any input by the 
relation R. In particular, R is also a relation from {1,3,4,6} to {a,b,c,e, f}, or a relation 
from Z to {a,b,c,...,z}, and so on. However, R is not a relation from X’ = {1,2,3,4} to 
Y since (6,c) € R but (6,c) g X' xY. 


5.3. Example: Graph of a Relation. If X and Y are subsets of R, we can draw the 
graph of a relation S from X to Y by drawing a dot at each point (x,y) in the plane R? 
such that (x,y) € S. For example, suppose X = [—3,3], Y = [—2,2], and S is defined by 
letting (x,y) € S iff x € R and y € R and (x?/4) +y? = 1. The graph of the relation S is 
the ellipse shown below. 


Since this ellipse is completely contained in the solid rectangle X x Y (whose boundary 
is indicated by dashed lines in the figure), S is a relation from X = [—3,3] to Y = [—2, 2]. 
Since (—2,0) € S, the input « = —2 has associated output y = 0. The input 0 is related to 
outputs 1 and —1. The input 3 € X is not related to any output, and the possible output 
2 € Y is not related to any input. S is also a relation from R to R, and S is a relation from 
[—2, 2] to [-1, 1]. But S is not a relation from [0,1] to [0,1], because the graph of S does 
not lie completely within the square [0,1] x [0, 1]. 


5.4. Example. A relation can be any set of ordered pairs, not necessarily a set determined 
by an explicit equation involving x and y. For example, the next figure shows the graph of 
a relation T from R to R. 
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5.5. Example. Suppose X and Y are any sets. Is Ø a relation from X to Y? Yes, since 
0C X xY by an earlier theorem. When we are thinking of @ as a relation, we could call 
it the empty relation. For finite sets X and Y, the arrow diagram of Ø contains no arrows. 
When X and Y are subsets of R, the graph of @ has nothing in it. 

At the other extreme, the full product set X x Y is a relation from X to Y, since 
XxY C X xY by an earlier theorem. In this relation, every possible input x € X is related 
to every possible output y € Y. The arrow diagram for X x Y contains every possible arrow 
leading from a point in X to a point in Y. When X and Y are closed intervals, the graph 
of X x Y (as we have seen before) is a solid rectangle of points with x-coordinates coming 
from X and y-coordinates coming from Y. 


Images of Sets under Relations 


Suppose R is a relation from X to Y. We have seen that each individual member x of X is 
related by R to zero or more members y of Y, namely those y for which (x,y) € R. More 
generally, if we are given a set C of possible inputs to R, we could examine the ordered 
pairs in R to find the set of all possible outputs associated with the inputs coming from C. 
This new set is called the image of C under the relation R and is denoted R[C]. Formally, 
we make the following definition. 


5.6. Definition: Image of a Set under a Relation. For all relations R, all sets C, and 
all objects y, |y € R[C] | iff} da € C, (x,y) € RI. 


Note that this definition places no restriction on the set C, although we usually take 
C to be a subset of the input space X. To get used to this definition, let us see how to 
compute images of sets for relations given by arrow diagrams and graphs. 


5.7. Example. Consider the relation 


R= {(1,¢), (1,e), (1, f), (3, f), (4, a), (4, b), (4, f), (6,c)} 


from Example 5.2. Given C = {1,3,5}, we compute R|C] = R[{1,3,5}] by scanning the 
ordered pairs in R and listing the second component of each ordered pair whose first compo- 
nent is in C. This produces the set R[C] = {c,e, f, f} = {c,e, f}. Note that R[C] = R[{1, 3}] 
since the input 5 is not related to any output. In fact, R[C] = R[{1}] since the output f 
(related to input 3) is already related to input 1. Image computations can be performed 
conveniently using the arrow diagram of R (shown on page 211). For example, to com- 
pute R[{2,4,6}] from the arrow diagram, start at the dots 2 and 4 and 6 on the left 
side and follow all possible outgoing arrows. The set of outputs that we reach, namely 
{a, b, f,c} = {a, b,c, f}, is the image R[{2, 4, 6}]. The following generic arrow diagram shows 
how R[C] is obtained from C. 


X 
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If we think of the arrows as rays of light leading from C to a screen represented by the 
set Y, then R[C] is the image cast by the object C on this screen. This picture is the source 
of the term image. 

5.8. Example. Define a relation S from R to R by letting S consist of all (x,y) € R? 
with x = y?. In the following pictures, we use the graph of S to compute $[{4}], S[(1,4)], 
S[Rso], and S{[—1, 1/4]]. 


First, S[{4}] = {2,—2} since (4,2) € S and (4,—2) € S and no output other than 2 or 
—2 is related to the input 4 (as 4 = y? is solved only by 2 and —2). To obtain this answer 
from the graph of S, we move up and down from the input 4 on the z-axis until we hit the 
graph of S. We hit the graph twice, at heights y = 2 and y = —2, so the image is {2, —2}. 

We compute the image of the open interval (1,4) similarly, but now we imagine moving 
up and down from all points on the z-axis in the range 1 < x < 4. We hit two arcs of 
the graph, which occupy heights between 1 and 2 (exclusive) and between —2 and —1 
(exclusive). In interval notation, we have S[(1,4)] = (—2, —1) U (1, 2). 


Next, if we allow all strictly positive real numbers as inputs to S, we see from the graph 
(or the formula x = y?) that we obtain all nonzero real numbers as the related outputs. 
Thus, S[R>0] = (—00,0)U (0,00) = Ryo. Starting with the closed interval [—1, 1/4], we find 
S[[-1, 1/4]] = [—1/2, 1/2]. On the other hand, $[[—1, —1/4]] = 0. 

5.9. Example. Consider the relation T whose graph was drawn in Example 5.4. Use the 
graph to find (approximately) THO}, T[{1.5}], and T[R<o]. 

Solution. To find T[{0}], draw a vertical line through the graph of T at x = 0, and look at 
the heights of the points on the graph that meet this line. We estimate T[{0}] = [—2, —4/3]U 
[—1/4, 1/2]. Similarly, we estimate 


TH1.5} = [-1.5, —1.1] U {9/8, 15/8, 17/8}. 
Finally, T[R<o] = (—2, —1] U [-1/4, 1/2) U [9/8, 15/8] U [2, 9/4]. 
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Properties of Images 


The next theorem lists properties of the image construction for relations. 
5.10. Theorem on Images. For all sets X, Y, A, B, I 40, A; (for i € I) and all relations 
Rand S: 
(a) Monotonicity: If A C B, then RJA] C R[B]. 
(b) Empty Set Images: R|Ø] = 0 and O[A] = 0. 
(c) Image of Unions: R[AU B] = R|A] U R[B] and R[U,<; Ail = Uje, RAI 
and (RU S)[A] = R[A] U SJA]. 
(d) Image of Intersections: R[AN B] C R[A]N R[B] and R[N;er Ail © Nier RA: 
and (RN S)[A] C R[A]N SJA]. (Equality is not always true.) 
(e) Image of Set Difference: R|A] — R|B] € R[A — B]. (Equality is not always true.) 
(£) Input/Output Property: If R is a relation from X to Y, then RJA] CY 
and R[A] = R[AN X]. 


We prove some representative parts of this theorem, asking you to prove other parts 
in the exercises. Fix arbitrary relations R and S, and sets A, B, etc., as in the theorem 
statement. 


5.11. Proof of Monotonicity. Assume A C B. Prove R[A] C R[B]. [Continue with the 
subset template.] Fix an arbitrary object y; assume y € R[A]; prove y € RIB]. We have 
assumed there exists x € A with (x,y) € R. We must prove 3z € B,(z,y) € R. Choose 
z = x, so that (z,y) = (x,y) € R. Since x € A and AC B, we have z € B, as needed. 


5.12. Proof that R[@] = Ø. Assume, to get a contradiction, that RØ] 4 0. This assumption 
means there exists an object y € R|Ø]. Expanding the definition of image, we have assumed 
Jx € Ø, (x,y) € R. But we know x ¢ Ø. The contradiction “x € Ó and x ¢ @” proves 
R|] = 9. 


5.13. Proof that R[U;<; Ai] = Uje, R[Ai]. [We try a chain proof.] Fix an arbitrary object 
y. We know 


yER U Ail e Jre U Ai, (x,y) E R (by definition of image) 
icI iel 
> dr, (: E€ U A; A (z,y) € n) (by quantifier conversion rule) 
iel 
& Jr, (Ji € I,x € Aj) A (x,y) € R) (by definition of union) 
s Jr, Ji € I, (x € A; A (x,y) € R) (by a quantifier property) 
s Ji c€ I, Jx, (x € A; A (x,y) € R) (by reordering 3 quantifiers) 
s Jie I, Jx € A; (x,y) ER (by quantifier conversion rule) 
s Ji € I,y € R[A]] (by definition of image) 
Sye U R[Aj] (by definition of union). 
iel 


5.14. Proof of Input/Output Property. Assume R is a relation from X to Y, which 
means RCX xY. 
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Part 1. Prove R[A] C Y. Fix an object z; assume z € R[A]; prove z € Y. We have assumed 
da € A, (x, z) E€ R. Since RC X x Y, we know (a, z) € X x Y. Thus x € X and z € Y, so 
zEY. 

Part 2. Prove R[A] = R[AN X]. Part 2a. Prove R[A] C R|AN X]. Fix z; assume z € RJA]; 
prove z € R|AN X]. We assumed Ja € A, (x,z) € R. We must prove Jw € AN X, (w, z) € R. 
Choose w = x, so (w, z) = (x,z) € R. We must check that w € AN X, which means w € A 
and w € X. On one hand, since w = x and x € A, w € A. On the other hand, since 
(w,z) € Rand RC X xY, we deduce (w,z) € X x Y and hence w € X. Part 2b. Prove 
R[AN X] C RJA]. Since AN X C A by an earlier theorem, we know R[A N X] C R[A] by 
the previously proved monotonicity property of images. 


Section Summary 


1. Relations. R is a relation from X to Y iff RC X xY. To make the arrow diagram 
of a relation R, draw an arrow from the input x € X to the output y € Y for 
each ordered pair (x,y) € R. When X and Y are subsets of R, make the graph 
of R by drawing a dot at each point in the plane with coordinates (x, y) € R. 


2. Images of Sets under Relations. For any set C, relation R, and object y, y € R[C] 
iff da € C, (x,y) € R. In an arrow diagram for R, R[C] is the set of all objects 
reachable by following arrows starting at points in C. In the graph of R, we find 
R[C] by drawing C on the z-axis, moving up and down from C into the graph, 
and finding the set of all y-coordinates reachable in this way. 


3. Properties of Images. For all relations R and all sets A and B, A C B implies 
R[A] C R[B]; R[AU B] = R[A] U R|B]; and R[Ø] = Ø. So the image construction 
preserves inclusions, unions, and the empty set. For intersections and differences, 
we have the weaker properties R[ANB] C R[A]NR[B] and R[A]—R[B] C R[A-B). 
Similar results hold for indexed unions and intersections. 


Exercises 


1. Let X = {1,2,3,4}, Y = {1,2,3,4,5}, and R = {(1,4), (1,1), (2,3), (2,4), (4,3)}. 
(a) Draw an arrow diagram for R. 
(b) Draw the graph of R. 
(c) Compute R[{1,2}], R[{3}], RH2,4}], and RX]. 
(d) Find all sets A,B C X with R[A] = R[B]. 


2. Let R be the relation with the following arrow diagram. 


(a) Describe R as a set of ordered pairs. 
(b) Find R{{1, 2, 3}], R[{2,4,5}], R[{3, 6}], and R{X]. 
(c) Find a relation S such that S[X] = Y and RNS =9. 
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3. 


10. 


11. 


12. 


Let X = {a,b,c,d,e}, Y = {v,w, x,y,z}, R = {(a,z), (b, y), (c, x), (d, y), (e, z)}, 
and S = { (a,v), (b, w), (c, x), (d, y), (e, v)} 

(a) Draw arrow diagrams for R, S, RA S, and RUS. 

(b) Find the image of {a, c,d} under each relation in (a). 

(c) Which subsets of Y have the form R[A] for some set A? 


Let R = {(x,y) E€ R x R : z? + y? = 25}. Draw the graph of R. Find R[{1}], 
Ri{—-3}], R[[6, 9]], R[(—5, 0)], and RIZ]. 

Let X = {1, 2,3,4,5}, and let R = { (x,y) E€ X x X : zy is even}. Draw an arrow 
diagram and graph for R. Compute R[{1,3}] and R[{2, 4}. 


Let R= {(x,y) E€ R: |zy| = 1}. 

(a) Draw the graph of R. 

(b) Find R{{2}], R[{O}], RG, 4]], and R[R <9). 
(c) Find RN (Z x Z). 
Define a relation R = {(x, y) : x € [—3,3] and (y = 2x or y = —x)}. 
(a) Draw the graph of R in the ry-plane. 
(b) Given A = [1,2] and B = (—3, —1], compute the sets R[A], RIB], R[A — B], 
R[A] — RIB], R[AN B], and R[A] A R|B]. 
Let R = {(x,y) E€ R x R : y = sinz} and S = {(x,y) E R x R : x = siny}. Com- 
pute the image of the following sets under R and S: (a) {0} (b) {r} (c) {—1/2} 
(d) [-1,1] (e) R. 

Let X = {1,2,3,4,5}, Y = {i j,k,m,n}, R = {(1,n), (1, i), (2, m), (4, i), (4, m), 
(5,3), (5, k), (5, m)f, and S = {(1, n), (2,7), (3, k), (3, m), (3, n), (4, m), (5, n)}. 

(a) Draw arrow diagrams for R, S, ROS, and R— S, viewed as relations from X 
to Y. (b) Let A = {2,3,5}. Compute the sets R[A], S[A], (RNS)[A], RIAJA SJA], 
(R- S)[A], and R[A] — S[A]. 

Let S be the relation from R to R with the graph shown here. 


Find S[{1,2,3,4}], SH—4,—1}], S[(—2,1]], S[[-1,2)], S[(2,3)], S[Reo], and 
S[R>o]. 

(a) Prove: for all relations R and S and all sets A, if R C S, then R[A] C S[A]. 
(b) Disprove: for all relations R and all sets A and B, if R[A] C R[B], then A C B. 
(c) Prove or disprove: for all relations R and S and all sets A, if R[A] C SJA], 
then RCS. 

(d) Prove or disprove: for all relations R and S, if R[A] C SA] for every set A, 
then RCS. 


Prove: for all sets A, O[A] = 0. 
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13. 


14. 
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16. 
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(a) Prove: for all relations R and S and all sets A, (RU S)[A] = R[A] U S[A]. 
(b) State and prove a generalization of (a) involving the image of an indexed 
union of relations. 


(a) Prove part (d) of the Theorem on Images. (b) For each set inclusion proved 
in (a), give a specific example where set equality does not hold. 


Prove part (e) of the Theorem on Images, and give a specific example where 
equality does not hold. 


For any sets A, B, C, find (with proof) the set (A x B)[C]. 


Suppose R is a relation having the following property: 

for all w, x,y, if (w,y) € R and (z, y) € R, then w = z. 

(a) What does this property say about the arrow diagram of R? 
(b) What does this property say about the graph of R? 

(c) Prove: for all sets A and B, R[AN B] = R[A]N RIB). 

(d) Prove: for all sets A and B, R[A] — R[B] = R[A — B]. 
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5.2 Inverses, Identity, and Composition of Relations 


This section introduces some concepts for relations — inverses, composition, and identity 
relations — that foreshadow and generalize the corresponding concepts for functions. One 
advantage of working with relations is that (unlike functions) every relation has an inverse 
relation. 


Inverse of a Relation 


Recall that a relation R from X to Y is a set of ordered pairs (x,y), where x is a member 
of the input set X and y is a member of the output set Y. The relation R associates zero 
or more outputs y € Y with each input x € X. We could reverse the action of R by 
interchanging the roles of inputs and outputs. This produces a new relation R~! from Y to 
X, called the inverse of R, which is found by replacing each (x,y) in R by (y, x). Formally, 
we have the following definition. 


5.15. Definition: Inverse of a Relation. For all relations R and all objects a and b: 


R`! is the relation such that | (a,b) € R7! iff | (b,a) € RI. 


This definition states that RT! is a relation, so the only possible elements of R~! are 
ordered pairs. We could be more explicit by saying: for all objects z, z € R7! iff Ja, 3b, z = 
(a,b) A (b,a) € R. A similar comment applies to other relations defined below. 


5.16. Example: Inverses via Arrow Diagrams. Suppose X = {1, 2,3, (4,5), (5,4)}, 
Y = {a,b,c,d,0}, and 


R= {(1,a), (1, c), (3,0), (4,5), a), (5; 4), €), (5, 4), d)}. 
By reversing all the ordered pairs, we see that 
R`! = {(a,1), (c, 1), (9, 3), (a, (4,5)), (c, (5, 4)), (d, (5, 4))}- 


The arrow diagrams for R and R~! are shown below. We get the arrow diagram for R7! 
by reversing all the arrows in the arrow diagram for R. 


Let us compute the images of some sets under R and Rt. First, 
R[{1, 2, 3}] = {a,¢, 0}, R-'[{a, c, 0} = {1, (4,5), (5,4), 3}, 
so R™'[R[{1, 2, 3}]] = {1, (4,5), (5, 4), 3} # {1,2, 3}. 
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R~*[{b, c} = {1, (5,4)}, RHL, (5, 4)}] = {a,c, d}, so RIR Hb, c}]] = {a, c, d} # {b,c}. 
Note R~'[0] = 0, but R-*[{O}] = {3}. 


5.17. Example: Inverses via Graphs. The left side of the figure below shows the graph 
of a relation S' from [—2,2] to [—2,2]. The right side of the figure shows the graph of the 
inverse relation 571. 


y S-71 hy 


i 


To see how the graph of the inverse is found, first consider the isolated point (2, —1) € S. 
Switching the coordinates, we see that (—1,2) is a point in S~!. Next, the part of the 
graph of S that looks like the letter N consists of line segments from (1/2,1/2) to (1/2, 2) 
to (3/2,1/2) to (3/2,2). Reversing the coordinates, we obtain three line segments from 
(1/2, 1/2) to (2,1/2) to (1/2, 3/2) to (2,3/2). The counterparts in S~! of the solid triangle 
and circular arc in S$ are computed similarly. Geometrically, the graph of S~! is obtained 
from the graph of S by reflecting the picture through the line y = x, since this reflection 
has the effect of replacing each (a,b) in the original graph by (b, a). 

Note that SHO, 1/2} = {—1/2}U[1/2, 2], whereas $~'[{0, 1/2}] = {1/2,3/2}. The latter 
image can be found either by looking at vertical lines through x = 0 and x = 1/2 in the 
graph of S~!, or by looking at horizontal lines through y = 0 and y = 1/2 in the graph 
of S. We have S~+[[0,2]] = [-2, —1] U [1/2,3/2] = S~1+[[1/2,00)]. Next, S[S~*[Reo]] = 
S|[-2, 0] U {2}] = [—3/2, —1/2] U [1, 3/2]. 


5.18. Example. We know 9) is a relation containing no ordered pairs. It follows that 
Ø! = Ø. Drawing a picture, you can check that the inverse of the relation R = [0,3] x [1, 2] 
is the relation R7! = [1,2] x [0,3]. More generally, for any sets X and Y, we claim that 
(XxY)~! = Y x X. Here is a quick chain proof: for any ordered pair (a, b), (a,b) € (XxY)~+ 
iff (ba) E X xY iffbe X anda€cY iffae Y and be X iff (a,b) EY x X. 


Identity Relation on a Set 


For any set X, we can create an especially simple relation from X to X called the identity 
relation on X and denoted Ix. This relation consists of all ordered pairs (x, x) with x € X. 
The name “identity” is used since each x € X is related to itself and nothing else. The 
formal definition follows. 


5.19. Definition: Identity Relation on a Set. For all sets X and all objects a and b: 


Ix is the relation such that | (a,b) € Ix | iff |a € X anda=b|. 
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5.20. Example: Arrow Diagram of an Identity Relation. 
Let X be the set {1, 2,3, (4,5), (5,4)}. The identity relation on X is 


Ix = {(1, 1), (2, 2), (3,3), (4,5); (4,5), (5, 4), (5, 4)) 


The arrow diagram for Ix is shown below. For each x € X, there is an arrow pointing from 
x to itself. We see that Tz = Ix. For all A C X, Ix[A] = A. These properties hold in 
general. 


5.21. Example: Graph of an Identity Relation. Let A = [—2,0) U(1,2] and B = 
{—2,-—1,0,1/2,1}. The next figure shows the graphs of the identity relations J4 and Ip. 
The graph J, consists of the portion of the line y = x consisting of points (c,c) with c € A; 
similarly for Ig. The entire line y = z is the graph of Ig. Note Ig = 0, so the graph of Ig is 
empty. 


Ty y Ig ty 


Composition of Relations 


Informally, given two real-valued functions f and g, we can form a new function go f (the 
composition of g and f) as follows. Starting with an input x € R, we feed this number into 
f to obtain an output y = f(x). Now we regard this output y as the input to g, obtaining 
a new output z = g(y). The net effect is that z = g(y) = g(f(x)) = (go f)(x). We can 
generalize this idea to relations, as follows. 


5.22. Definition: Composition of Relations. For all relations R and S and all ordered 
pairs (a,b), So R is the relation such that | (a,b) € So R) iff| dc, (a,c) E RA (c,b) € S|. 


The relation S o R is the composition of the relations S and R, in this order. 
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Note carefully the order of the relations in the expression S o R, which is motivated by 
the analogy with the function composition go f. Observe that when computing (g o f)(x), 
we start with the input x, apply the function f, and then apply the function g. Analogously, 
given an ordered pair (a,b) € S o R, we know there exists c with (a,c) € R and (c,b) € S. 
This means that the original input a is related (under R) to the intermediate output c, and 
then c (viewed as an input to S$) is related to the final output b. The net effect is that input 
a is related to output b under So R. 


5.23. Example: Composition of Relations via Arrow Diagrams. Let X = {1,2,3,4} 
and Y = {a,b,c,d,e}. Define a relation R from X to Y and a relation S from Y to X by 


R= {(1,d), (3, d), (4, a), (4, e)}, S= {(a, 2), (b, 1), (c, 3), (č; 4), (d, 2), (e, 1)}. 


The arrow diagrams for R and S are shown below. 


We can compute So R, which is a relation from X to X, by starting at points in X and seeing 
where we can go by traveling along an R-arrow followed by an S-arrow. By examination of 
the figure, we arrive at this arrow diagram for So R: 


SoR X 


2o 


= 


24 


Symbolically, So R = {(1, 2), (3, 2), (4,2), (4,1)}. For example, (4,1) € S o R because 
(4,e) € Rand (e,1) € S. Similarly, we can compute Ro S, which is a relation from Y to Y, 
by combining the arrow diagrams for R and S in the other order, as shown here: 
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The arrow diagram for Ro S looks like this: 


Y Ros Y 


° 
a 
e 
b 
° 
c 
e 
d 


oe 


So Ro S = {(b, d), (c,a), (c, d), (c,e), (e,d)}. Note carefully that | So R# Ro S| in this ex- 
ample (and in most examples). This means that composition of relations is not commutative. 
Indeed, in our example, (Ro S) N (S o R) = 0, which says that these two sets of ordered 
pairs have no members in common. 


5.24, Example. Let X = {1,2,3}, R = {(1, 2), (2,3), (3,1)}, and S = {(1, 3), (2, 1), (3, 2)}. 
You can check that Ro S = {(1,1), (2,2), (3,3)} = So R = Ix. Thus, it does sometimes 
happen that Ro S = So R. In this example, it is also true that Rt = S = Ro R and 
SS ha Sos. 


5.25. Example. Let R = [0,2] x [1,3] and S = Ir U {(xz,2— x) : x € [0,2]}. Trying to read 
off the compositions S o R and Ro S from the graphs of R and S can be tricky, so let us 
return to the definition. Which ordered pairs (a,b) belong to So R? First, the original input 
a must belong to [0, 2], and all such inputs are related to all numbers c € [1,3] (and nothing 
else). Proceeding to S, a given input c € [1,3] is always related (under S) to c; furthermore, 
each number c in the range [0,2] is also related to 2 — c. Thus, the final output b can be 
any number in [1,3], and (by considering c € [1,2]) b can also be any number in [0,1]. In 
summary, (a,b) € So R iff a € [0,2] and b € [0,3], so that So R = [0,2] x [0,3]. 

On the other hand, when is (a,b) € Ro S? Now we need to find c with (a,c) € S and 
(c,b) € R. One possibility is that (a,c) € Ig, which means a € R and c = a. Then (c, 0) is 
in R iff c = a is in [0,2] and b is in [1,3]. So every element of [0,2] x [1,3] is in Ro S. The 
other possibility for (a,c) € S is that a € [0,2] and c= 2 — a € (0, 2]. In this case, (c,b) € R 
holds iff b € [1,3]. We conclude Ro § = [0,2] x [1,3] =RASOR. 


Section Summary 


1. Inverse of a Relation. For any relation R, the inverse of R is the relation R7! 
such that for all a and b, (a,b) € R7! iff (b,a) € R. To go from the arrow diagram 
of R to the arrow diagram of R~!, reverse all the arrows. To go from the graph 
of R to the graph of R~}, reflect the xy-plane through the line y = x. 


2. Identity Relation. For any set X, the identity relation on X is the relation Ix 
such that for all a and b, (a,b) € Ix iff a € X and a = b. The arrow diagram for 
Ix has an arrow from « to x for each x € X. The graph of Ix is the part of the 
line y = x consisting of those points (c,c) with c € X. 

3. Composition of Relations. For any relations R and S, the composition of R fol- 
lowed by S is the relation S o R such that for all a and b, (a,b) € So R iff 

de, (a,c) E RA (c,b) € S. The arrow diagram for So R is found by drawing an 

arrow from a to b whenever there is a path from a to b consisting of an arrow for 
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R followed by an arrow for S. In most cases, So R 4 Ro S. So composition of 
relations is not commutative. 


Exercises 


1. Let X = {1,2,3,4,5,6}. Define a relation R from X to X as follows: 
R= {(, 2), (1, 4), (1, 6), (2, 4), (4, 5), (4, 6), (6, 3)}- 


a) Draw an arrow diagram for R and the graph of R in R?. 

b) Describe RT! as a set of ordered pairs, as an arrow diagram, and as a graph. 
c) Describe Ro R as a set of ordered pairs, as an arrow diagram, and as a graph. 
d) Compute the following images of sets under the relations R and R7!: 


( 
( 
( 
( 
R({2,3,4}],  R{3,5}], Ri{1}], R-*[{2,3,4}], Ro*[{3, 5}, 
ROKI] RR“ [{1,2,3}]], R-“[R[{1, 2, 3}]]. 
2. Define X and R as in the previous problem. Define 


S= {(2, 1), (2, 2), (2,3), (2, 4), (1, 5), (1,6), (3, 3), (4, 6)}. 

Draw arrow diagrams for each relation: 

(a) RoS (b) SoR (c) (RoS)! (d) Roto S7! (e) S-to R! (f) (RoS)oR. 
3. Let X = {1,2,3,4}, R= {(a,b) EX x X:a<b} and 

S={(a,b)e Xx X:a< d}. 

(a) Draw arrow diagrams for R and S. 

(b) Compute R71, 57t, Ro S, So R, RoR, SoS, RUS, and RNS. 
4. Define a relation T from R to R by the following graph. 


(a) Carefully sketch the graph of T7?. 

(b) What is the set T N Iz? 

(c) Compute T[{2}], T-}[{2}], Tl[6, 7], and T-2{[0, 1)]. 
5. Let R be the relation in Example 5.16. 

(a) Find all subsets A of X such that R~1[R[A]] = 
(b) Find all subsets B of Y such that R[R~1[{B]] = 
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6. Let S be the relation in Example 5.17. (a) Find S[{—2,—1}]. (b) Find 
S~*[{-2, -1}]. (c) Find S{[1, 2]]. (d) Find $~"[[1,2)]. (e) Find S~![S[R <o]]. 

7. (a) Draw an arrow diagram and a graph for I¢_1,9,2}. (b) Draw the graph of Ip_z. 
(c) Let X = {0,1} x {1,2,3}. Draw an arrow diagram for Ly. 

8. Let R= {(z,x7):2 € R}, S = {(x, 32): x € R}, and T = {(z,£ — 1): x € R}. 
Draw graphs of the following relations: (a) Ro S (b) So R (c) RoT (dA) To R 
(e) SoT (£) T oR (g) Ro R- (h) R o R. 

9. Let R = {(2,y) € R? : z? +y? = 4} and S = {(z,y) € R? : y? = 27}. True 
or false? Explain each answer. (a) R! = R. (b) St = S. (c) RoR = S. 
(d) So S = S. (e) Ro S = R. (f) So R=R. 

10. Repeat the previous exercise taking R = {(z,y) € R? : x? +y = 4} and S = 
{(@,y) € R? : y = —a}. 

11. Let A, B, C, and D be arbitrary sets. Describe the following relations as explicitly 
as possible: (a) (Ax B)o(C'x D) (b) (Ax B)™t (c) Ian Ip (d) I4AUIpg (e) (Ax B)ol 
(£) (B x A) o R, where R is a relation from A to B. 

12. Let X = {1,2,3}. Give four examples of relations R from X to X satisfying 
Ro R= R = R™!. In one of your examples, find a relation R consisting of five 
ordered pairs. 

13. Define X = {1,2,3,4}, Y = {a,b,c}, and R = {(1,b), (4, a), (3,c)}. 

(a) Find a relation S from Y to X such that So R = Ix, or explain why no such 
S exists. (b) Find a relation S from Y to X such that Ro S = Iy, or explain why 
no such § exists. 

14. Repeat Problem 13 taking X = R, Y = Ryo, and R = {(z,y) € R? : y = 2°}. 

15. Repeat Problem 13 taking X = Y = [-1,1] and R = { (z, y) E€ R? : £? +y? =1}. 


16. Define a relation R by the following arrow diagram. 


(a) Compute R}, Ro R}, and R! o R. 
(b) Find three relations S from Y to X such that So R = Iq 4}. 
(c) Find all subsets B C Y for which there exists a relation S from Y to X with 
Ro S= Íp. 

17. Let R be a relation from X to Y. 
(a) Find a condition on R that guarantees the existence of a relation S from Y 
to X with SoR=X xX. 
(b) Find a condition on R that guarantees the existence of a relation T from Y 
to X with RoT =Y xY. 
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5.3 Properties of Relations 


In this section, we state and prove many properties of inverses, identity relations, and 
compositions. Before starting, you are encouraged to review the relevant proof templates 
from set theory and the definitions from the last two sections. 


Closure Theorem for Relations 


We begin with a preliminary theorem indicating what kind of relations are produced by 
various constructions. 


5.26. Closure Theorem for Relations. For all sets X, Y, Z, I 49 and all relations R, 
S, Ri (for i € I): 


(a) If R and S are relations from X to Y, then RUS and RNS and R — S are relations 
from X to Y. 


(b) If R; is a relation from X to Y for each i € J, then U 
from X to Y. 


(c) R is a relation from X to Y iff R7! is a relation from Y to X. 
(d) Ix is a relation from X to X. 


(e) If R is a relation from X to Y and S is a relation from Y to Z, then So R is a relation 
from X to Z. 


R; and (),-, Ri are relations 


icl tel 


We prove (a) and (e) as illustrations. Fix arbitrary sets X, Y, and Z and arbitrary 
relations R and S. For (a), assume R and S are relations from X to Y, which means 
RCXxYandSCX xY. We must show RUS and RNS and R-—S are relations from 
X to Y. Part 1. Prove RUS C X x Y. Since we assumed RC X x Y and SC XxY, 
this follows from the least upper bound property (part (f) of the Theorem on Subsets; see 
page 106). Part 2. Prove RN S C X x Y. We know RN S C R by part (c) of the Theorem 
on Subsets, and we assumed R C X x Y, so RNS C X xY follows by transitivity of C 
(part (b) of the Theorem on Subsets). Part 3. Prove R— S C X xY. As in part 2, we know 
R-SCRandRCXxY,soR-—SCX xY follows. 

For (e), assume R is a relation from X to Y and S is a relation from Y to Z. Prove So R 
is a relation from X to Z. We have assumed R C X x Y and S C Y x Z. We must prove 
So RC X x Z. To prove this, fix an arbitrary ordered pair (a,b), assume (a,b) € So R, and 
prove (a,b) € X x Z. We have assumed dc, (a,c) € RA (c,b) € S. We must prove a € X 
and b € Z. First, since (a,c) € Rand RC X xY, we see that (a,c) E€ X xY. Soa E€ X and 
cE Y, hence a € X. Second, since (c,b) € S and S CY x Z, we see that (c, b) € Y x Z. So 
cE Y and be Z, hence DE Z. 


Properties of Inverses, Identity Relations, and Composition 


Our next theorem lists some general properties of inverse relations, identity relations, and 
composition of relations. 


5.27. Theorem on Relations. For all sets X, Y, C and all relations R, S, T: 
(a) Inverse of Identity: Ix' = Ix. 

(b) Double Inverse: (R~+)~1 = R. 

(c) Inverse of Union: (S U T)! = StU Tt = Tt U S7. 
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(d) Inverse of Intersection: (S A T)! = StA T7! = Tt A 871. 


(e) Composition with Identity: If R is a relation from X to Y, then Ro Ix = R and 
Iy oR=R. 


(£) Associativity of Composition: T o (S o R) = (T o S) o R. 
(g) Inverse of a Composition: (Ro S)7} = S7} o RH. 
(h) Image of a Composition: (S o R)[C] = S[R[C]]. 


(i) Monotonicity Properties: If X C Y, then Ix C Iy. If R C S, then R7! C S~t! and 
ToRCToSand RoTCSoT. 


(j) Empty Set Properties: Ig = Ý and 07t = Ø and RoW = ģ and 0o R =b. 
(k) Distributive Laws for U and o: 


(SUT)o R= (So R)U(T o R) andTo(SUR)=(ToS)U(ToR). 
(1) Partial Distributive Laws for N and o: 
(SAT)oRC(SoR)A(ToR) and To(SNR)C(ToS)N(ToR). 


Equality does not always hold. 


Items involving U and N generalize to indexed unions and intersections. As noted earlier, 
composition of relations is not commutative: in most cases, Ro S £ So R. 


We prove some representative parts of the theorem, leaving other parts to the exer- 
cises. You are strongly encouraged to work out proofs of all parts of this theorem now, 
before reading further. [Hints: When proving subset statements or set equality statements 
involving relations, you should fix arbitrary ordered pairs (a,b) rather than fixing arbitrary 
objects. Chain proofs may work well for sufficiently simple set equalities, but two-part set 
equality proofs are preferred for more complex statements. When proving a set is empty, 
contradiction proofs are often the fastest way.] 

Have you proved the theorem yet? To see our proofs, read on. Fix arbitrary sets X, Y, 
C and relations R, S, and T. 


5.28. Proof that (R~')~! = R. We give a chain proof. Let (a,b) be an arbitrary ordered 
pair. Using the definition of inverse relation twice, we see that (a,b) € (R71)! iff (b,a) € 
R! iff (a,b) € R. So (R71)! = R since these two relations contain exactly the same 
ordered pairs. 


5.29. Proof of Composition with Identity. Assume R is a relation from X to Y, which 
means R C X x Y. We prove Ro Ix = R (letting you prove Iy o R= R). 

Part 1. Prove Ro Iy C R. Fix (a,b); assume (a,b) € Ro Ix; prove (a,b) € R. By definition 
of composition, we know there exists c with (a,c) € Ix and (c,b) € R. Next, by definition 
of Ix, we know a € X and a = c. Since (c,b) € R, we deduce (a,b) € R, as needed. 

Part 2. Prove R C Ro Ixy. Fix (s,t); assume (s,t) € R; prove (s,t) € Roly. We must prove 
dw, (s,w) € Ix A (w,t) € R. Choose w = s. On one hand, since (s,t)€ Rand RC XxY, 
we see that (s,t) € X x Y. So s € X and t € Y, which shows that w = s is a member of X. 
Then (s, w) = (s,s) € Ix. Next, (w, t) = (s,t) € R by assumption. 


5.30. Proof of Distributive Law for U and o. We give a chain proof of (SU T) o R= 
(So R) U (T o R), leaving the proof of the companion identity to you. Fix an arbitrary 
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ordered pair (a,b). We know 


(a,b) E€ (SUT)OR & Ac, (a,c) E RA (c,b) E€ SUT 
S de, (a,c) E RA [(c,b) € SV (6,6) € T] 
s Ac, ([(a,c) E€ RA (c,b) € S] V [(a,c) € RA (c,b) € TJ) 
& (dc, [(a,c) € RA (c,b) € S]) v (Ac, [(a, 0) € RA (c, 6) € T]) 
=(a,b)E SoRV(a,b)EToOR 
= (a,b) E€ (So R)U (To R). 


The steps follow by definition of composition, definition of union, the distributive law for ^ 
and V, the distributive law for 4 and V, the definition of composition, and the definition of 
union. 


5.31. Proof of Empty Set Properties. We prove that Ø! = Ø. Assume, to get a 
contradiction, that @~! # Ø. Then there exists an object z € Ø7!; in fact, there exists an 
ordered pair (a,b) € 0~'. By definition of inverse, (b,a) € Ø. But we also know (b,a) ¢ 0, 
which gives us a contradiction. 

Next we prove that Rof = Ø. Assume, to get a contradiction, that Ro) Æ Ø. Then there 
exists an ordered pair (a,b) € R o Ø. By definition of composition, we know there exists c 


with (a,c) € Ø and (c,b) € R. But (a,c) ¢ 0, so we have found a contradiction. 


5.32. Proof of Partial Distributive Law. We prove To (SMR) C (ToS) (To R). 
Fix (a,b); assume (a,b) € T o (S N R); prove (a,b) € (ToS) (T o R). Our assumption 
means there exists w with (a, w) € SN R and (w,b) € T; so there exists w with (a, w) € S 
and (a,w) € R and (w,b) € T. We must prove (a,b) € ToS and (a,b) € To R. First, to 
prove (a,b) € T o S, we must prove Jx, (a,x) € S A (x,b) € T. Choose x = w; note that 
(a,w) € S and (w,b) € T have been assumed. Second, to prove (a,b) € T o R, we must 
prove Jy, (a,y) € RA (y,b) € T. Choose y = w; note that (a, w) € R and (w,b) € T have 
been assumed. 

We give an example to show that set equality may not hold. Let R = {(1,2)}, S = 
{(1,3)}, and T = {(2,4), (3,4)}. By drawing arrow diagrams or looking at the ordered 
pairs, we find that To S = {(1,4)}, To R = {(1,4)}, and so (T o S)N (To R) = {(1,4)}. 
But SN R=0, so To(SN R) =. Thus To (SN R) is a proper subset of (T o S)N(To R) 
in this example. 


5.33. Partial Proof of Associativity of Composition. We prove To(SoR) C (ToS)oR; 
we let you prove (To S)oR C To(SoR). Fix (a,b); assume (a,b) € To(SoR); prove (a,b) € 
(T o S)o R. We have assumed there exists w with (a,w) € So R and (w,b) € T. Expanding 
this again, we have assumed there exists w such that there exists x with (a,x) € R and 
(x,w) € S and (w,b) € T. Now, since (x, w) € S and (w,b) € T, we see that (x,b) ET o 5. 
Since (a,x) € R and (z,b) € T o S, it follows that (a,b) € (T o S) o R, as needed. 


5.34. Proof of (Ro S)~! = S-t o R7!. We try a chain proof. Fix an ordered pair (a,b). 
We know 


(a,b) € (Ro S) 1S (b,a) E€ RoS (by definition of inverse) 
© dc, (b,c) € SA(ca)ER (by definition of composition) 
© de (ca) E€ RA (b,c) ES (by commutativity of AND) 
S Jc, (a,c) E€ R7! A (c, b) € S7} (by definition of inverse) 
S (a,b) € S7} o R7! (by definition of composition). 
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Section Summary 


I; 


Closure Properties of Relations. The union, intersection, and set difference of 
relations from X to Y are also relations from X to Y. The inverse of a relation 
from X to Y is a relation from Y to X. The identity relation Ix is a relation 
from X to X. If R is a relation from X to Y and S is a relation from Y to Z, 
then So R is a relation from X to Z. 


Properties of Inverse Relations. For all sets X and relations R and S, Iz = Íx, 
(R1)! = R, (RU S) = RU S7}, (RA S)! = R! N S~t, ea), = 
S-to R71, RCS implies R7! C S7}, and 07t = Q. 


Properties of Composition of Relations. For all relations R from X to Y, 

Rolx = R = Iyo R. Composition of relations is associative but not commutative: 
T o (S o R) = (T o S) o R always holds, but S o R = RoS does not always 
hold. We have distributive laws such as (S U T) o R = (S o R) U (T o R) and 
(SNOT)oRC(SoR)N(ToR). 


Exercises 


= 


CONAA AR WD 


=. = 
= oOo 


12. 
13. 


14. 


15. 


16. 


17. 


18. 
19. 


Prove parts (b), (c), and (d) of the Closure Theorem for Relations. 
Prove part (a) of the Theorem on Relations. 

Prove parts (c) and (d) of the Theorem on Relations. 

Finish the proof of part (e) of the Theorem on Relations. 

Finish the proof of part (f) of the Theorem on Relations. 

Prove part (h) of the Theorem on Relations. 

Prove part (i) of the Theorem on Relations. 

Finish the proof of part (j) of the Theorem on Relations. 

Finish the proof of part (k) of the Theorem on Relations. 

Finish the proof of part (1) of the Theorem on Relations. 


Let A and B be arbitrary sets. 
(a) Prove Iaug = I4 U Ip. (b) Prove Iang = I4 N Ig = I4 0 Íp. 
Prove: for all relations R and S, Ro S = S o R iff Rt o 9571 = S71 o RE. 


(a) Prove: for all sets X and all relations R, S,T on X, if Rt o R = Ix and 
Ro S= RoT, then S = T. (b) Show that the result in (a) need not hold if we 
drop the hypothesis R7! o R = Ix. 

Prove: for all relations R, if RC Rt, then R = Rt. 


(a) Prove: for all sets X and all relations R on X, if Ix C R, then RC RoR. 
(b) Prove or disprove the converse of part (a). 


Prove or disprove: for all sets X and all relations R on X, if R = R71, then 
Ro R= Íx. 


Prove or disprove: for all sets X and all relations R on X, if Ro R = Ix, then 
R=R Ë. 


Prove: for all relations R, S, and T, (So R) AT =O iff (ToR')nNS=0. 


Prove: for all relations R, S, and T, TA (So R) C To(R-to R). Give an example 
where equality does not hold. 
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20. Prove: for all relations R, S, and T, RA SAT C (T o 87t) o R. Give an example 
where equality does not hold. 


21. Prove: for all relations R, S, and T, if ToS C S and T~!oS C S, then SN(ToR) = 
To(SNAR). 


22. Prove: for all relations R, S, T, U, (So R)AN(UoT) C [Uo((U~!0S)n(ToR!))JoR. 
Does equality always hold? 
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5.4 Definition of Functions 


One of the most fundamental and pervasive concepts in mathematics is the notion of a 
function. In calculus, we study real-valued functions of a real variable such as the exponential 
function, the natural logarithm function, polynomial functions, and trigonometric functions. 
In advanced calculus and other parts of mathematics, we need to consider more general kinds 
of functions. In this section, we give a formal definition of functions within the framework 
of set theory. We illustrate the logical nuances of this definition with many examples of 
functions and non-functions, presented using arrow diagrams, graphs, and formulas. 


Arrow Diagrams of Functions and Non-Functions 


We begin with an informal discussion of functions, which motivates the formal definition 
below. Informally, a function g from a set A into a set B is a rule that assigns to each 
object x in A a unique object g(x) in B. The set A is called the domain of g; the set B 
is called the codomain of g; and g(x) is called the value of g at the input x. The crucial 
point is that each input x in A has exactly one output g(x) associated to it, and this output 
is required to belong to the codomain B. We use the notation |g: A — B | to mean that 


g is a function with domain A and codomain B. 


8 8 83 

2 

A Bs Be so 

9 SO & 
A B A B A B 
FIGURE 5.1 


Arrow diagrams illustrating functions and non-functions. 


These requirements can be conveniently visualized using arrow diagrams. Consider the 
six arrow diagrams in Figure 5.1. The three arrow diagrams in the top row all represent 
valid functions from the set A = {v, w, x, y} to the set B = {1, 2,3, 4}. In each case, there is 
exactly one arrow emanating from each point in the domain A, and each such arrow leads 
into the codomain B. We have, for example, gi(w) = 4, g2(y) = 2 = go(v), and g3(a) = 4 
for alla € A. Note that it is permissible for several arrows to hit the same element of B. For 
example, in the diagram for g3, four different arrows hit the same point 4 € B. Similarly, 
there may exist points in B that are not hit by any arrows. For example, in the diagram 
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for g2, no arrow leads to 1 € B, and yet gə is still considered to be a function mapping A 
into B. 

On the other hand, g4 is not a function from A to B because there is no arrow leaving 
the input point w € A. The arrow diagram gs is not a function because there are multiple 
arrows leaving v (and also y). Finally, gg is not a function from A to B because g(x) = 0, 
which is not an element of the codomain B. However, gg can be regarded as a function from 
A to the set {0, 1,2,3,4}. 


Graphs of Functions and Non-Functions 


We continue to build intuition for the function concept by looking at the graphs of some 
functions and non-functions with domain R and codomain R. Informally, given a function 
f : R —> R, the graph of f is the set of ordered pairs Gy = {(x, f(x)) : x € R}, which is 
a certain relation from R to R. The graph Gy encodes exactly the information needed to 
understand how the function f transforms inputs to outputs. Specifically, for each input 
x € R, there must exist a unique y € R such that (x,y) belongs to the graph Gs, and we 
then know that y = f(a) is the output associated with the input x by the function f. 


G, G, G; 
y, Yh y 
e %0,2) 
x x x 
G, G; Ge 
y. Yh y 


0,2) a 
oi (1) 
x x x 
NY Pa- 


FIGURE 5.2 
Graphs of functions and non-functions. 


For example, consider the graphs shown in Figure 5.2. Each graph illustrates a relation 
on R x R consisting of certain ordered pairs (x,y). The first three pictures are graphs of 
functions from R to R. Specifically, Gi is the graph of the function gı : R —> R defined 
by gı(x) = —|zx| for each real x. G2 is the graph of the function gg : R + R defined by 
g(x) = (x?/4) +1 for each real x. G3 is the graph of the constant function g3 : R > R 
defined by g3(x) = 2 for all real x. In each case, every particular real input x has exactly one 
associated output y. Some of the potential outputs y € R do not appear as actual outputs 
of the functions, but this does not matter. Similarly, some outputs have more than one 
associated input, for instance 2 = g3(0) = g3(1) = g3(3), but this does not matter either. 
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On the other hand, G4 = {(z,y) € R x R : 2? +y? = 4} is not the graph of any function 
from R to R. One reason is that some inputs x in the domain R do not have any associated 
output y in the codomain. For example, there is no y € R such that (3, y) € G4. We could 
try to circumvent this difficulty by shrinking the domain of the function from all of R to the 
closed interval [—2, 2]. But we still do not get a function, because there are some inputs that 
have more than one associated output based on the graph. For example, (0,2) € G4 and 
(0, —2) € G4. In terms of formulas, the condition defining the relation G4 can be rewritten 

= +/4- 2? for —2 < x < 2. This is a double-valued expression, but we insist that all 
functions be single-valued. One possible way to extract a function from this example would 
be to define g4 : [—2,2] > R by ga(a) = —V4— x? for —2 < x < 2. The graph of this 
function would be just the lower half of the circle of radius 2 centered at the origin. 

Next, Gs is not the graph of a function with domain R, since each negative real input x 
has no associated output using this graph. If we shrink the domain to Ryo, we would obtain 
a function gs : R>ọ —> R given by g5(x) = yz for all x > 0. Here we could also shrink 
the codomain (the set of allowable outputs) to get a function gf : R>o > Rso given by 
gi (x) = yz for all x > 0. Are gs and gz the same function? Based on the formal definition 
below, the answer turns out to be no, because gs and gs have different codomains. Here, gs 
and gf are different functions with the same graph and the same domain. 

Finally, Gg is not the graph of a function because of the ordered pairs (1,1) € Ge and 
(1,—1) € Ge. If exactly one of these ordered pairs were removed from Ge, we would get 
the graph of a function gg : R —> R. We could make additional functions with the same 
domain and graph by changing the codomain to be any set containing the image Gg[R] of 
the relation Gg. 

Recall the key condition in the informal definition of a function: g is a function from 
the domain A C R to the codomain R iff for each x € A there exists exactly one y with 
y = g(x), and moreover y must belong to R. A given relation G C R? will be the graph 
of such a function iff the following geometric property holds: for every xp € A, the vertical 
line x = 2o intersects G in exactly one point; and for every xo € R — A, the vertical line 
x = Xo does not intersect G at all. In beginning calculus, this is often called the vertical line 
test for determining whether a given graph is a function. One point not often stressed in 
elementary treatments is that we must know the intended domain A in advance in order to 
see if a graph passes this test. Often A is tacitly assumed to be the largest possible domain 
that might work for the given graph, namely A = {x € R: dy E R, (x,y) € G}. 


Formal Definition of a Function 


We are now ready to present the formal definition of a function. This definition needs to 
capture all the information contained in the arrow diagrams and graphs we have been dis- 
cussing: the input set A, the output set B, the rule by which inputs x € A are associated 
with outputs y € B, and the key requirement that each input have exactly one associated 
output. The crucial insight needed to formalize these ideas is that the vague idea of a “rule” 
for transforming inputs to outputs can be captured precisely by forming the relation consist- 
ing of all ordered pairs (x,y) such that input x is mapped to output y by the function. By 
analogy with the case where A = B = R, this relation is called the graph of the function. For 
instance, the function gı shown in Figure 5.1 has graph G = {(v, 2), (w, 4), (x, 1), (y, 3)}. 
These considerations motivate the following formal definition. 


5.35. Definition: Functions. A function g is an ordered triple g = (A, B, G) such that: 
A and B are sets, G is a relation from A to B, and Vz € A, 3!y, (x,y) E€ G. 

A is called the domain of g; B is called the codomain of g; and G is called the graph of g. 
The notation | g : A > B | means that | g is a function with domain A and codomain B. 


234 An Introduction to Mathematical Proofs 


For all objects x,y, we write | y = g(x) iff (x,y) € G|. 


So we can describe the graph of g as | G = { (z, g(x)): x € A} |. 


We must emphasize at the outset that the formal ordered triple notation g = (A, B, G) 
is almost never used; instead, we write g : A > B to introduce a new function with domain 
A and codomain B. Similarly, only in rare cases do we need to mention the graph by 
writing (x,y) € G; instead, we use the equivalent function notation y = g(x). If we say “let 
g : A —> B be defined by g(x) = (expr),” where (expr) is some expression involving 2, it is 
understood that the graph of g is the relation G consisting of all ordered pairs (x,y) where 
x € A and y is the value of the expression for this choice of x. 

Let us spell out the three conditions that the relation G must satisfy in order to be the 
graph of a function g : A > B. 


(a) Valid Inputs and Outputs: Every member of G is an ordered pair (x, y) with z € A and 
yEB. 


(b) Existence of Outputs: For all x € A, there exists y with (x,y) € G. 
(c) Uniqueness of Outputs: For all x,y,z, if (x,y) € G and (a, z) € G, then y = z. 


Here are the same three conditions for g : A —> B to be a function, written in function 
notation. 


(a) Valid Inputs and Outputs: For all x, y, if y = g(x), then z € A and y € B. 

(b) Existence of Outputs: For all x € A, there exists y such that y = g(x). 

(c) Uniqueness of Outputs: For all x,y,z, if y = g(x) and z = g(x), then y = z. 

The uniqueness condition can be stated in a way that does not explicitly mention y and z. 
(c') Uniqueness of Outputs: For all z1, x2 € A, if xı = x2, then g(x1) = g(£2). 


The main technical drawback of the function notation g(x) is that the very use of the 
notation suggests that the existence and uniqueness of the output g(x) associated with 
input x are already known. But in fact, when introducing a new function for the first time, 
we really need to prove these assertions before the function notation can be legitimately 
used. Some of the dangers of using this notation prematurely are illustrated in the following 
subsection. 


5.36. Example. Consider the arrow diagram labeled gə in Figure 5.1. We could formally 
define the function gg by setting 


g2 = ({v, w, z, y}, {1,2,3,4}, {(v, 2), (w, 3), (x, 4), (y, 2)}). 


But we would usually introduce this function by saying “define go : {v, w, x, y} > {1, 2,3, 4} 
by setting g(v) = 2, g(w) = 3, g(x) = 4, and g(y) = 2.” 


5.37. Example. Consider the graph G2 in Figure 5.2. One possible function having this 
graph could be formally defined as 


g = (R, R>0, G2) = (R, Rso, { (x, 27/4 +1): x E€ R}). 


But we would usually introduce this function by saying “define g : R — R>o by setting 
g(x) = x? /4+ 1 for all z € R.” 


Formulas for Functions and Non-Functions 


Here are more examples of function notation using some familiar functions from calculus. We 
defer to the next section the task of proving that these formulas really do define functions. 
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5.38. Example. Define the cube function C : R —> R by C(x) = x? for x € R. Define 
the absolute value function A : R —> R by A(x) = x for real x > 0, and A(x) = —2 for 
x < 0; we often write A(x) = |x|. Define the sine function S : R > R by S(x) = sing 
for all real x. If we shrink the domain and codomain of S, we obtain a related function 
Sı : [-—71/2,2/2] > [-1,1] given by S\(x) = sin x for —7/2 < x < 1/2. This function is not 
the same as the function S. Finally, define the natural logarithm function L : (0,00) > R 
by L(x) = Ing for all real x > 0. We obtain a related function with a larger domain by 
defining Lı : Ryo > R by Ly(x) = In|z| for all real x # 0. Note Li(x) = L(A(a)) for all 
x #0. 


The next examples illustrate the pitfalls that can occur when we use function notation 
before checking the three defining properties above. 


5.39. Example. Suppose we try to define f : R > R by f(x) = 1/z for all real x. This 
definition is incorrect, because the input x = 0 has no associated output in R. We could 
remedy this by defining f(x) = 1/x for all real nonzero x, and setting f(0) = 0 (say). 
Alternatively, we could shrink the domain, defining fo : Ro > R by fo(x) = 1/2 for all 
real x Æ 0. 


5.40. Example. Suppose we try to define g : Z > Z by g(m) = m/2 for all m € Z. The 
graph of g is {(m,m/2) : m € Z}, but this is not a subset of Z x Z. In other words, the 
proposed function does not map the given domain Z into the given codomain Z. We could 
fix this by enlarging the codomain to Q (say), defining g : Z > Q by g(m) = m/2 for all 
me Z. 


5.41. Example. Suppose we try to define h : Q > Z by h(m/n) = m for all rational 
numbers m/n. More precisely, the graph of h is H = {(m/n,m) : m € Z,n € Zyo}. In 
fact, h is not a function because its graph fails the Uniqueness of Outputs condition. For 
instance, choosing x = 1/2 = 2/4, we find that (1/2,1) € H and (2/4,2) = (1/2,2) € H, 
yet 1 # 2. Rephrasing this example using the (technically forbidden) function notation, we 
have 1/2 = 2/4 and yet h(1/2) = 1 4 2 = h(2/4). We often say that “h is not a well-defined 
function” to indicate that the relation H assigns multiple outputs to a given input. In this 
situation, it is clearer and safer to discuss the graph H instead of using the function notation 
h(m/n) for the non-function h. 


5.42. Example. Finally, let us try to define f : Q>o > Qso by f(m/n) = n/m for all 
positive rationals m/n. More precisely, we are setting f = (Qs0, Qso, F), where the graph 
F is the set {(m/n, n/m) : m,n € Zo}. We might at first think that f is not single-valued, 
as in the previous example. But in fact, the f considered here is a legitimate, well-defined 
function! We verify Uniqueness of Outputs as follows. Suppose x,y,z satisfy (x,y) € F 
and (a,z) € F; we must prove y = z. By definition of F, there exist m,n,p,q € Zso with 
(x,y) = (m/n, n/m) and (x, z) = (p/¢,q/p). Comparing first coordinates, m/n = x = p/q, 
so mq = pn. Using this and comparing second coordinates, we see that y = n/m = q/p = z, 
as needed. In function notation, we have just shown: for all m/n and p/q in Qso, if m/n = 
p/q then f(m/n) = f(p/q), i.e., n/m = q/p. (To finish the proof that f is a function, we 
must also check Existence of Outputs and that F C Qso x Qso.) 


5.43. Remark. Some texts adopt a different definition of functions that essentially iden- 
tifies a function g = (A, B,G) with its graph G. On one hand, the domain A of g can be 
reconstructed from G, since it is the set of all first coordinates of the ordered pairs in G. On 
the other hand, the codomain B of g cannot be inferred from G alone. For example, for the 
three functions shown in the top row of Figure 5.1, knowing where all the arrows go does 
not give us enough information to deduce what the codomain B is. Similarly, we cannot 
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determine the codomain of a real-valued function from inspection of its graph, although we 
often assume an intended codomain of R. In many areas of mathematics (including abstract 
algebra, topology, and differential geometry), it is crucial to distinguish functions with the 
same domain and the same graph but different codomains. Some of the reasons for this ap- 
pear below when we discuss surjections, bijections, and invertible functions. Thus we must 
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use the more elaborate definition of functions that explicitly includes the codomain. 


Section Summary 


il 


Definition of a Function. A function is an ordered triple g = (A, B,G) where A 
and B are sets, G C A x B, and Va € A, 3!y, (x,y) E€ G. A is the domain of g, 
B is the codomain of g, and G is the graph of g. The notation g : A + B means 
g is a function with domain A and codomain B. For all x and y, y = g(a) means 
(x,y) € G. We have G = {(x,g(x)): x € A}. 


Arrow Diagrams of Functions. In an arrow diagram for a function g : A > B, 
every input x € A has exactly one arrow leaving it, and every arrow arrives at 
some output y € B. It is allowed for a value in the codomain to have no arrows 
hitting it, or multiple arrows hitting it. 


Graphs of Real-Valued Functions. A relation G C R? drawn in the zy-plane is 
the graph of a function g : A > R iff for each xp € A, the vertical line x = xo 
intersects G in exactly one point, and for each xo ¢ A, the vertical line x = xo 
does not intersect G. Each horizontal line y = yo may intersect G' once, never, or 
multiple times. 


Three Conditions to be a Function. We often define functions g : A > B by giving 
a formula specifying g(x) for each x € A. To be sure we do have a well-defined 
function, we must check three things: g(x) is in the codomain B for all x € A; for 
each x € A there exists at least one associated output g(x); and for all z1, £2 € A, 
if xı = x2, then g(x1) = g(x2). The last condition can be stated more precisely 
using the graph G of g: we must check for all x, y, z, if (x,y) € G and (a,z) € G 
then y = z. 


Exercises 


1. 


Formally describe the functions gı and g3 pictured in Figure 5.1 as ordered triples. 
Also give informal definitions of these functions using function notation. 


Suppose we reverse all the arrows in Figure 5.1. Which of the resulting diagrams 
represents functions from B into A? Which of these diagrams represents functions 
from some domain C into A? 

Which of the arrow diagrams in Figure 5.3 represent functions from A to B? 
Explain. 

Let A = {1,2,3,4,5,6,7}. Which relations are graphs of functions from A to 
A? Explain. (a) Ry = {(i,i +1) : i = 1,2,3,...,6}. (b) Ro = Rı U {(7,1)}. 
(c) Rg = A x {5}. (d) R4 = {5} x A. (e) Rs = {(a,8 — a) : a € A}. (£) Re = Ia. 
For each relation G; shown in Figure 5.2, find G;[R] and G7 +[R]. 


(a) Which of the graphs in Figure 5.4 are graphs of functions from R to R? 
Explain. (b) Repeat (a), replacing each graph in the figure by its inverse. 
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FIGURE 5.3 
More arrow diagrams. 


10. 


11. 


Let A = {1,2,3} and B = {4,5,6,7}. For which of the following choices 
of the graph G is (A,B,G) a function? Explain briefly. (a) {(2,4), (3, 7)}. 
(b) {(1,6), (2,4), (3,4)}. (c) {(e,@+4) : € A}. (d) {(æ, 22) : z € A}. (e) Ax B. 
Draw arrow diagrams for all functions f : {1,2,3} —> {a,b}. 

For fixed n € Z>o, how many functions f :{1,2,...,n} — {a,b} are there? 
Draw graphs for all functions g : {1,4} > {2,3,5}. 

) For fixed m € Zo, how many functions g : {1,4} > {1,2,...,m} are there? 
Which of the following relations are graphs of functions? For those relations that 
are graphs of functions, determine the domain and smallest possible codomain 
for a function with that graph. 


(a 
(b 
(a 
(b 


a) {x,y ERxR: x= y}. 
b) {(z,y) ER x R: 2? =y}. 
c) {(z,y) ER x R: 22 =y} 
d) {(z,y) ER x R: z= y?}. 
e) {(z,y) ER x R: 2? = y}. 
f) {(z,y) ER x R: z? = y3}. 
g) {(x,y) ER x R: 22 = y?}. 


Which of the following relations are graphs of functions? For those relations that 
are graphs of functions, determine the domain and smallest possible codomain 
for a function with that graph. 


a) {(x, y) ER x R:ay= 5}. 
b) {(z,y) ERxR:y= eae 
c) {(z,y) ER xR: y= ear}, 
d) R x {4}. 
e) {0} x R. 
f)QxZ. 
g) (Q x {1}) U ((R— Q) x {0}) 
h) 4. 
i) {((z,y),z) E€ (R x R) xR: z = zy}. 
j) {((@,y), 2) € (R x R) x R : zy = z}. 
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More graphs in R?. 


12: 


13. 


14. 


15. 
16. 


17. 


Let X = {1,2,3} and define h : (P(X) — {0}) + X by letting h(A) be the least 
element of A for every nonempty A C X. Give a formal description of the function 
h as an ordered triple. 

Let Y = {2, {3}, {2, {3}}} and define f : Y > P(Y) by f(y) = {y} for ally E Y. 
Describe the graph of f as a set of ordered pairs. 


Which of the following are well-defined functions? Explain. (Assume m,n, a,b, c,d € 
Z with n,b,d > 0.) 

a) f: Q> Z given by f(m/n) =m+n. 

b) g : Q > Q given by g(m/n) = (m + 2n)/n. 

c) h: R > {0,1,...,9} given by h(x) = the second digit after the decimal point 
in the base-10 representation of x. 

d) m : Q x Q > Q given by m((a/b, c/d)) = (ac)/(bd). 

e) p : Q x Q —> Q given by p((a/b,c/d)) = (a + c)/(b + d). 

f) s : Q x Q > Q given by s((a/b, c/d)) = (ad + bc) /(bd). 

Prove that for all sets A and B, (A, B, Ø) is a function iff A = 0. 


Enlarging the Codomain. Prove that for all sets A, B,C, if B C C and (A, B, F) 
is a function, then (A, C, F) is a function. 

Prove or disprove: for all A, B, G, if (A, B, G) is a function, then (B, A, G7!) is a 
function. 
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5.5 Examples of Functions and Function Equality 


This section defines some basic examples of functions, such as constant functions, identity 
functions, inclusion functions, characteristic functions, and arithmetic functions. We also 
introduce operations (called pointwise sum, difference, product, and quotient) that can be 
used to construct new functions from old functions. We then develop a proof template 
for showing that two functions are equal. We use this template to prove some algebraic 
properties of the pointwise operations on functions, analogous to corresponding properties 
for addition and multiplication of real numbers. 


Constant Functions, Inclusion Functions, and Identity Functions 


5.44. Definition: Constant Functions. For any sets A and B and any fixed object c € B, 
the function f : A —> B defined by f(x) = c for all x € A is called a constant function with 
value c. 


The graph of the constant function defined above is G = {(x,c): x € A} = Ax {c}. Let 
us prove that f = (A, B,G) really is a function by checking the three conditions from §5.4. 
First, every member of G is an ordered pair (a,c) with x € A and c € B, so Œ C Ax B. 
Second, for a fixed x € A, there exists y with (x,y) € G, namely y = c. Third, fix x,y,z, 
assume (x,y) € G and (a, z) € G, and prove y = z. By definition of G, we must have y = c 
and z = c, so y = z follows. 


5.45. Definition: Inclusion Functions. Let A and B be two sets such that A C B. 
Define a function j = ja,B : A > B by setting ja B(x) = x for all x € A. We call j4,p the 
inclusion function mapping A into B. 


The graph of the inclusion function defined above is G = {(x,x) : x € A} = I4, the 
identity relation on A. Let us prove that jA Bg = (A, B, G) really is a function. First, every 
member of G is an ordered pair (x,x) with zx € A and x € B (since A C B), so that 
G C Ax B. Second, for a fixed x € A, there exists y with (x,y) € G, namely y = x. Third, 
fix x,y,z, assume (x,y) € G and (x,z) € G, and prove y = z. By definition of G, we must 
have y = x and z = 7z, so y = z follows. 


5.46. Definition: Identity Functions. Given any set A, define a function Id4 : A > A 
by setting Ida (x) = a for all x € A. We call Id, the identity function on the set A. 


Note that the identity function Id, is the same as the inclusion function jĘ,4, so Id, is a 
function. On the other hand, if A is a proper subset of B, then Id4 is not the same function 
as the inclusion j4,g. To check this carefully, note that Id4 is the ordered triple (A, A, G), 
where G = {(x,x) : x € A}, whereas j4,p is the ordered triple (A, B, G). Since A # B, 
these ordered triples are not equal. Less formally, Id4 and j4,5 are unequal functions since 
their codomains are different (even though their domains and graphs agree). We return to 
this point later when we discuss function equality. 


5.47. Example. Figure 5.5 gives arrow diagrams for Id4 and j4,p when A = {2,4,5} and 
B = {1,2,3,4,5}. 


Arithmetic Functions and Pointwise Operations on Functions 


We are all familiar with the arithmetic operations on real numbers: addition, subtraction, 
multiplication, and division. We do not attempt to define these operations, instead viewing 
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FIGURE 5.5 
Contrasting identity functions and inclusion functions. 


them as undefined concepts that obey certain axioms (see §2.1 and §8.1). However, now 
that the language of functions is available, we can at least describe more precisely what 
kind of objects these undefined operations are. 

For example, consider the undefined operation of adding two real numbers. For any two 
real numbers x and y, we can apply the addition operation to these two numbers to obtain 
a unique output x + y, called the sum of x and y. More formally, addition transforms the 
ordered pair of inputs (x,y) into the real output x + y. So we could describe addition as 
a function ADD : R x R > R, where ADD((z,y)) is defined to be the sum of x and y. We 
would almost always write x + y as a shorthand for ADD((z,y)). Similarly, multiplication 
is a function MULT : R x R > R, where MULT((x,y)) = x - y is the product of x and 
y. Subtraction and division could be defined similarly, provided we use R x Ryo as the 
domain of the division function (since we cannot divide by zero). Alternatively, we could 
first introduce an additive inverse function AINV : R > R, denoted AINV(z) = —z for z € R, 
and then define subtraction via x — y = «+ (—y) for x,y € R. Similarly, we could introduce 
a multiplicative inverse function MINV : Ryo —> R, denoted MINV(z) = z7! for z € Ryo, 
and then define division via x/y = x- (y~+) for x,y € R with y £ 0. 

We can use the addition operation on real numbers to create an addition operation 
acting on functions, as follows. 


5.48. Definition: Pointwise Sum of Functions. Suppose A is a set and f : A > R, 
g : A —> R are two real-valued functions with domain A. Define the pointwise sum 


f+g:A—R|by letting | (f + g)(x) = f(x) + g(x) | for all x € A. 


The graph of f +g is the set {(x, f(x) + g(a) : x € A)}. You can check that f +g is a 
function, using the fact that f and g are known to be functions. 


5.49. Example. If f : R — R is the exponential function and g : R — R is the cosine 
function, then f +g : R > R is given by (f + g)(x) = e? + cosa for all real x. For another 
example, the function h : R > R defined by h(x) = x +3 is the pointwise sum of the identity 
function on R and the constant function with value 3. 


Note that the domain A need not be a subset of R to form pointwise sums. It is required 
that the functions being combined share the same domain. We can define more pointwise 
operations as follows. 


5.50. Definition: More Pointwise Operations. Given functions f,g : A —> R, we define 
the following operations (called pointwise product, pointwise additive inverse, pointwise 
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difference, pointwise reciprocal, and pointwise quotient). For parts (d) and (e), we assume 


g(x) £0 for all x € A. 


(a) Define | f -g : A >R] by setting | (f -g)(x) = f(x) - g(2) | for x in A. 
(b) Define |—g : A > R] by setting | (—g)(x) = —(g(z)) | for x in A. 

(c) Define | f — g : A >R] by setting | (f — g)(x) = f(x) — g(x) | for x in A. 
(d) Define |1/g : A >R] by setting | (1/9)(x) = 1/g(z) | for x in A. 

(e) Define | f/g : A >R by setting | (f/g)(x) = f(x)/g(x) | for x in A. 


Take care not to confuse the pointwise product f -g (also denoted fg) with the compo- 
sition of functions f o g (to be introduced later). Similarly, we write 1/g for the pointwise 
reciprocal to avoid confusion with the inverse function g~! (to be studied later). Also, be 
very careful not to confuse a function such as f + g with (f + g)(x), which is the output of 
that function at input x. In particular, (—g)(x) is the output of the function —g at input 
x; this output is defined to be —(g(x)), the additive inverse of the real number g(x). 


5.51. Example. The function q : R > R given by q(x) = 5x is the pointwise product of 
the constant function with value 5 and the identity function on R. The squaring function 
s:R—R given by s(x) = 2? is the pointwise product s = Idg Idg, since (Idg Idg) (x) 
(Idp(x)) - Idp(x)) 2 (x) for alla € R. 


=“£-n=14*=8 

By starting with the identity function and constant functions on R and repeatedly 
iterating the pointwise sum and product constructions, we can build the polynomial func- 
tions on R. For example, Idr Idr Idg +2 Idg —5 is the polynomial p : R — R given by 
p(x) = x? + 2x — 5 for x € R. The general definition is as follows. 


5.52. Definition: Polynomials. A function p : R —> R is a polynomial (in one real 
variable) iff there exist an integer n > 0 and constants ao,...,@, E R such that for all 
zeER, 


n 
p(x) = ao + a2 + aga? +--+ + ans” = aia, 
i=0 


Equality of Functions 


Recall that a function f : A — B was formally defined as an ordered triple f = (A, B, G), 
where G = {(x, f(x)) : x € A} is the graph of f. Given another function h = (A’, B’, G"), 
when are the functions f and h equal? By the fundamental property of ordered triples, we 
see that f = h iff A= A’ and B = B’ and G = C. In other words, two functions are equal 
iff they have the same domain, the same codomain, and the same graph. Suppose we know 
that A = A’ and B = B’. You can check that the graph G = {(z, f(x)) : x € A} equals 
the graph G’ = {(x,h(x)) : x € A} iff for all x € A, f(x) = h(x). Thus we can say that 
two functions are equal iff they have the same domain, the same codomain, and the same 
values on their common domain. This leads to the following proof template for proving the 
equality of two functions. (Compare this to earlier proof templates for proving equality of 
sets or equality of numbers.) 


5.53. Proof Template for Proving Function Equality. Let functions f : A — B and 
g : A —> B have the same domain A and the same codomain B. To prove f = g: 
Fix an arbitrary object x € A. Use the definitions of f and g to prove f(x) = g(x). 
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The last step (proving equality of f(x) and g(x), which are objects in the codomain B) 
is often achieved by a chain proof, writing down a chain of known equalities linking the 
left side f(x) to the right side g(x). We now illustrate this proof template by proving some 
algebraic laws satisfied by pointwise operations on functions. 

5.54. Theorem on Pointwise Operations. Suppose A is any set and f,g,h: A > R 
are functions. Let 04 and 14 denote the constant functions from A to R with values 0 and 


1 (respectively). The following function equalities hold: 

(a) Commutativity: f+g=gt+fandf-g=q-f. 

(b) Associativity: (f +g) +h=ft+(g+h) and (f-g)-h=f-(g-h). 
(c) Identity: f +04 =f=04+fand f-la=f=lay-f. 
( 
I 


d) Inverses: f + (—f) =04 =(-f) + f. 

f g(x) £0 for all x € A, then g- (1/g) = 14 = (1/9) - g. 

(e) Distributive Laws: f -(g+h) =(f-g)+(f-h) and (f+g)-h=(f-h)+(g-h). 
Proof. We prove a few identities here, leaving the others as exercises. To prove the first 
commutative law in (a), first note that f +g and g + f are both functions with domain A 
and codomain R (by definition of pointwise sum). Following the proof template for function 
equality, fix an arbitrary object x € A, and prove (f + g)(x) = (g + f)(x). We compute 
(f +9)(x) = f(x) +9(x) = g(x) + f(x) = (g + f)(x), where the first step uses the definition 
of f + g, the second step uses the known commutativity of addition of real numbers, and 
the third step uses the definition of g + f. 

Next we prove f-14 = f. Both f-14 and f have domain A and codomain R. Fix x € A. 
We know (f - 1a)(x) = f(a) - 14(x) = f(x)- 1 = f(x), using the definition of pointwise 
product, then the definition of the constant function 14, and then the multiplicative identity 
axiom for R. 

Finally, we prove f- (g+h) = (f-g)+(f-h). Repeated use of the definitions shows that 
gth, f-(gt+h), f-g, f-h, and (f-g)+(f-h) all have domain A and codomain R. Fix 
x € A. Compute 

[f-(gth)|(z) = -(g+h)(x) (by def. of pointwise product) 
-[g(xz) + h(a)] (by def. of pointwise sum) 
( (x)) + (f(a)-h(a)) (by the distributive law in R) 
= (f-g)(x)+(f-h)(x) (by def. of pointwise product) 

(f-g)+(f-hA)| (2) (by def. of pointwise sum). 


So f-(g+h) =(f-9)+(f-hA), as needed. 


In the previous computation, be sure you understand why the input x does or does not 
appear at various places in each expression. For example, the initial expression [f-(g+h)](«) 
is the output of the function f-(g+h) at input x, while the next expression f(x). (g+h)(x) 
is the product of the two real numbers f(x) and (g + h)(x). These two quantities are equal 
by the definition of the pointwise product of the functions f and g +h. 


5.55. Example: Changing the Codomain. Suppose f : A — B is a function with 
graph G = {(a, f(x)) : x € A}. For any set C such that B C C, we can create a new 
function f* : A > C by setting f*(x) = f(x) for all x € A. Formally, f = (A, B,G) and 
f* =(A,C,G). We have constructed f* from f by enlarging the codomain. More generally, 
the new codomain C can be any set such that G C Ax C. This means that for all x € A, we 
must have f(x) € C. This condition holds iff G[A] (the image of A under the relation G) is 
a subset of C. We must remember that when B 4 C, f and f* are not the same function, 
although some texts use the same letter f to denote both functions. 
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Characteristic Functions (Optional) 
We end this section with another class of examples of functions that are often used in 
probability. 


5.56. Definition: Characteristic Functions. Let X be any set, and let A be a fixed 
subset of X. Define xa : X > R by setting xa(x) = 1 for all  € A, and xa(x) = 0 for all 
x ¢ A. We call xa the characteristic function of A in X. 


The graph of xa is (A x {1}) U ((X — A) x {0}). You can prove that x4 is a function. 
Intuitively, xa is an indicator function that tells us which inputs x belong to A and which 
do not. For example, taking X = R, we have 


xo(0.73) = xq(0) = xe(-11/3)=1,  xol(T) = xo(e) = xa(V3) = 0. 


When using characteristic functions, it is critical to specify what the intended domain X 
is, since this is not part of the notation y 4. 

We can combine characteristic functions with the same domain using the pointwise 
operations introduced earlier. We can then verify various identities that connect operations 
on sets to pointwise operations on functions. The next theorem gives a sample of such 
identities. 


5.57. Theorem on Characteristic Functions. For all sets X and all A,B C X, the 
following function equalities hold. 


a) XAnB = XA ` XB- 

b) xx-a = 1- Xa. 

c) Xø = 0x (where 0x : X > R is the constant function with value 0). 
d) xx = 1x, where 1x : X — R is the constant function with value 1. 
e) XAUB = XA + XB — XANB- 

f) If AN B = Q, then yaus = XA + XB- 


en ee ee 


Proof. We prove part (a) and leave the others as exercises. First note that yang and XAXB 
are both functions with domain X and codomain R. Fix x € X, and consider four cases. 


). 
Case 2. Assume x € A and x ¢ B. Then xang(z) =0=1-0= xa(x)x B(x) = (xANB)(2). 
Case 3. Assume x ¢ A and x € B. Then y4na (xz) =0=0-1= xa(x)x B(x) = (xaxsB)(z). 
Case 4. Assume x ¢ A and x ¢ B. Then xang(z) =0=0-0= xa(x)x B(x) = (xANB)(2). 


Thus, Xang (x) = (x4XB)(2) in all cases. So XanB = XA ` XB- 
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1. Constants, Inclusions, and Identity Functions. For fixed c € B, the function 
f:A-— B given by f(x) = c for all x € A is the constant function on A with 
value c. For sets A C B, the function jA, B : A > B given by ja, B(x) = & for all 
x € Ais the inclusion function mapping A into B. For any set A, the function 
Id; : A > A given by Id4(x) = v for all x € A is the identity function on A. 


2. Pointwise Operations on Real- Valued Functions. For functions f,g : A > R, we 
can define new functions f +g, f-g, —g, f—g, and (when g(x) # 0 for all x € A) 
f/g by performing the indicated arithmetic operation one input at a time. For 
instance, (f + g)(x) = f(x) + g(x) for all x € A, and (f - g)(x) = f(x)g(x) 
for all x € A. Pointwise operations on functions satisfy algebraic laws such as 
commutativity, associativity, and the distributive laws. 
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Function Equality. Two functions are equal iff they have the same domain, the 
same codomain, and the same graph (i.e., the same value at each point in the 
common domain). To prove that f : A— B and g : A— B are equal functions, 
fix x € A and use known facts to prove f(x) = g(x). Changing the codomain of 
a function produces a different function with the same domain and graph. 
Characteristic Functions. For A C X, define xa : X > R by xalx) = 1 for 
x € A and ya(x) = 0 for x € X — A. For A,B C X, we have XAnB = XA ` XB; 
Xx-a = l- xa, and XauBp = XA + XB — XAnB- 


Exercises 


I; 


Draw arrow diagrams for the following functions. (a) Idj_2,9)qz (b) j{0,3},{0,1,2,3,4} 
(c) jzo (d) xa : X > R, where X = {1,2,...,10} and A consists of the prime 
numbers in X. 


Draw graphs in the xy-plane for the following functions. Mark the domain on the 
x-axis and the codomain on the y-axis. (a) Id}-æ,3juz (b) Idj—co,3jnz_ (c) zr 
(d) x(-3,-1Jua,3}: R>R_ (e) xq :R>R (f) X{x:z3+2r2—132+10=0} : R > R. 
Consider the following six functions: 


fi:R—>R given by filz) = 
fo: R= [0, 20) given by fə(x) = |z|; 
fs : (0,00) > [0,00) given by s(x) = |x; 
fa: ROR given by fa(x) 

fs : [0,00) + [0,co) given by fs(x) 

fe: R= [0,c0) given by felx) 


(a) Which pairs of functions in this list are equal? 

(b) Which pairs of functions have the same graph? 

(c) Sketch the graph of each function, indicating the domain and codomain of 
each. 

Define f,g,h,k : R — R by setting f(x) = x°, g(x) = x—4, h(x) = Tz, and k(x) = 
2” for x € R. Find simplified formulas for the following functions constructed 
using pointwise operations. (a) f+g (b) fh (c) —2g (d) kk (e) fk(h— g) 
(£) ggh—7f (g) f/k. 

Suppose f,g : R > R satisfy f(1) = 

g(x) # 0 for all x € R. Compute: (a 
(d) (f/9)(3) (e) (F Idr +g9)(1). 

Let X = [0,4], A = [0, 2], B = (1,3), C = (2, 4]. Draw the graphs of the following 
functions from X to R. (a) xa (b) 3xs (c) 2xa +5xce (d) 3x4 — xB + 2xc. 

Let f,g,h : A > R be functions. Prove the following identities from Theorem 5.54, 
explaining every step with a definition or an algebraic fact about real numbers. 

(a) (f+g)th=f+(gth). (b) f+(-f)=0a=(-A+f. Of g=g-f. 
(d) (f-9)-h=f--h). (e) ft+0a=f=Oatf. (£) (ft+g9)-h=(F-A)+(g-A). 
(a) Suppose f,g : A > R. Prove carefully that f +g, f-g, and —g are indeed 
functions. (b) Let Z = {x € A: g(x) = 0}. Show that the generalized pointwise 
quotient f/g : (A — Z) > R, defined by (f/g)(x) = f(x)/g(x) for alla € A — Z, 
is a function. 


5, g(1) = 2, f(3) = —4, g(3) = —3, and 
FD) (b) (Ff — 9)(3) (©) (F9)() 


Suppose X is a set and A C X. Prove that the characteristic function xa is 
indeed a function. 
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10. 


11. 


12. 


13. 


14. 


15. 


16. 


17. 


18. 


19. 


20. 


21. 


Compute the following quantities. All characteristic functions mentioned have 
domain and codomain R. (a) xọ(1.107) (b) xrł-ọ(v2) (c) xz(111111/37) 


(4) Xole" = 7°) (e) Eita Xira T) (6) Tear Xiri 2) 
Let X be a set and let A, B be subsets of X. Prove parts (b) through (f) of 
Theorem 5.57. 


Prove: for all sets X and all A,B,C C X, 


XAUBUC = XA + XB + XC — XANB — XANC — XBNC + XANBNC.- 


Suppose n € Zso and fi,..., fn : A — R are functions. Give recursive defini- 
tions of the pointwise sum )>;"_, f; and the pointwise product [];_, fi of these n 
functions. 

Suppose n € Zyo and Aj,..., An are subsets of X. 

(a) Prove by induction that y4yn...n4, = [joy XA: 

(b) Prove: if A; N A; = Ý for all i 4 j, then yayu...ua, = dope Xdi: 

Suppose X is a set, J Æ Ø is an index set, and for each i € I, A; is a subset of X. 
(a) Prove: for all  € X, xq,_, 4: (£) = min{xa, (x) : i € I}. 

(b) Prove: for all x € X, xy,_, 4: (£) = max{x4, (x): 7 € I}. 


Suppose G is a relation satisfying the uniqueness condition 


Va, Vy, Vz, (2, y) € GA (2,2) E€ G) > (y= 2). 


Prove that for all sets A and B, (A, B, G) is a function iff A = {x : dy, (x,y) € G} 
and G[A] € B. 


(a) Prove: for all f,g : R > R, if f and g are polynomials, then f +g is a 
polynomial. (b) Prove: for all f,g: R > R, if f and g are polynomials, then f -g 
is a polynomial. 

Even Functions. A function f : R > R is even iff for all x € R, f(-x) = f(z). 
a) Prove: for all f,g: R —> R, if f and g are even, then f + g is even. 

b) Prove: for all f,g: R > R, if f and g are even, then f -g is even. 

c) Prove: for all f : R > R, the function g : R > R, given by 

g(x) = [f (x) + f(—a)]/2 for x € R, is even. 

Odd Functions. A function f : R > R is odd iff for all x € R, f(-x) = — f(x). 
a) Prove: for all f,g: R > R, if f and g are odd, then f + g is odd. 

b) Prove: for all f,g : R > R, if f and g are odd, then f - g is even. 

c) If f is odd and g is even, what can you say about f +g and f- g? 


Prove: for all f : R —> R, there exist g,h : R — R such that g is even, h is odd, 
and f=gth. 

Linear Functions. A function f : R > R is linear iff for all c, x,y € R, 

fæ +y) = f(x) + f(y) and fler) = cf (x). 

(a) Prove: for all f,g: R > R, if f and g are linear, then f + g is linear. 

(b) Prove: for all f : R > R and all b € R, if f is linear, then bf is linear. 

(c) Prove or disprove: for all f,g : R > R, if f and g are linear, then f- g is linear. 
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5.6 Composition, Restriction, and Gluing 


This section investigates three more operations for building new functions from given func- 
tions: composition, restriction, and gluing. The composition operation combines functions 
f: A— B and g: B-C to get a new function go f : A —> C that acts by doing f and 
then doing g. The restriction operation takes one function f : A —> B and builds a new 
function by shrinking the domain of f to some subset of A. The gluing operation allows us 
to combine two or more functions (under certain conditions) to build a new function with a 
larger domain. For instance, we need gluing to define the absolute value function and other 
functions defined by cases. 


Composition of Functions 


5.58. Definition: Function Composition. Given functions f : A > Bandg: BOC, 
define the composition go f : A —> C by setting (go f)(x) = g(f(a)) for all xz € A. 


Note that the composition go f is only defined when the codomain of f equals the domain 
of g. In this case, the output of the composite function on an input x in A is found by first 
performing f on x to obtain an intermediate output f(x) in B, and then performing g on 
f(x) to obtain the final output g(f(x)) in C. The graph of go f is {(x, g(f(x))) : a € A}. 
It is routine to check that go f really is a function, using the assumption that f and g are 
functions. You can also verify that if F is the graph of f and G is the graph of g, then the 
composition of relations Go F is the graph of go f. 


5.59. Example: Composition via Arrow Diagrams. Let A = {1,2,3,4,5} and B = 
{a, b,c, d,e}. Figure 5.6 shows the arrow diagrams for functions f : A— B and g : B > B. 
For each x € A, we obtain (go f)(x) = g(f(x)) by following an f-arrow and then a g-arrow 
in succession. The resulting arrow diagram is shown at the bottom of the figure. The graph 
of go f : A > Bis {(1, c), (2, e), (3, d), (4, d), (5, a)}. In this example, the composite function 
f og is not defined, since the codomain of g (the set B) does not equal the domain of f 
(the set A). 


5.60. Example: Composition via Formulas. Let f : R > R be given by f(x) = x”, and 
let g: R > R be given by g(x) = x +2. In this case, both fog and go f are functions with 
domain R and codomain R. For each real x, we compute (g o f)(x) = g(f(x)) = g(x?) = 
x? + 2, whereas (f o g)(x) = f(g(x)) = f(x +2) = (x +2)? = x? + 4x + 4. In particular, 
(go f)Q) =3 49 = (f og)(1), so we conclude that go f # fog. This shows that function 
composition is not commutative in general. 

We can also combine function composition with pointwise operations. For instance, 
go f + fg is the function sending x € R to g(f(x)) + f(a)g(z) = £? +24 27(ax + 2). 
We have f o f = f- f, since for each x € R, (f o f)(z) = f(f(x)) = (2)? = xt and 
(f - f)(x) = f(a) f(x) = x? - x? = zt. On the other hand, g o g Æ g: g, since (go g)(x) = 
g(g(x)) = (x +2) +2 = x + 4 and (g- g)(x) = g(x)g(x) = (x + 2)", and these functions 
disagree at x = 1. 


5.61. Example. Let A = {1,2,3,4}, and let functions f,g,h: A— A have graphs 


Gs = {(1, 3), (2,4), (3, 1), (4, 2)}, 
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A 


FIGURE 5.6 
Arrow diagrams illustrating composition. 


You can check that go f and f og both have graph {(1, 1), (2,4), ( 
go f = fog. But goh # hog, since (for instance) (go h)(1) = g(h 
(ho g)(1) = h(g0)) = h(3) = 4. 


Although function composition is not commutative, the following algebraic laws do hold. 


3,3), (4,2)}, so that 
(1)) = g(2) = 2 and 


5.62. Theorem on Function Composition. 

(a) Associativity: For all f : A> B, g : B > C, and h : C > D, ho (go f)=(hog)of. 
(b) Identity: For all f : A> B, fold, = f = Idg of. 

(c) Right Distributive Laws: For all f : A —> R and all g, h : R —> R, (g+h)of = (gof)+(hof) 
and (g: h)o f = (go f): (ho f). 

Proof. We use the proof template for proving equality of functions (see page 241). For 
part (a), fix f : A> B, g : B —> C, and h : C —> D. We have functions (go f): A> C 
and (hog): B > D, so that the functions ho (go f) and (ho g) o f are both defined with 


domain A and codomain D. Now, fix x € A. Repeatedly using the definition of function 
composition, we find that 


[h o (g 0 F) (£) = h((g 0 f)(2)) = 


[(hog)o f(x) = (hog)(f(a)) = 
Thus, ho (go f) = (hog)o f, as needed. 

For part (b), fix f : A > B. Recall Id4 : A — A is defined by Id4(x) = x for all x € A. 
We now see that f o Id : A — B is defined and has the same domain and codomain as 
f:A—- B. Fix « € A, and compute (f oId,4)(x) = f(Ida(x)) = f(x). So fold, = f. The 
equality f = Idg of is proved similarly. 

Finally, we prove the first equality in part (c). Fix f : A —> R and g,h : R > R. Both 
(g+h)o f and (go f)+ (ho f) are functions from A to R. Fix x € A, and compute 


((g +h) o f(a) = (g + h) (2)) = g (2)) + hF (@)) 
= (go f)(x) + (ho f)(x) = [go F) + (ho f(a). 
So the functions (g + h)o f and (go f)+ (ho f) are equal. 


(g(F(2))); 
(9(f(2))). 
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In the exercises, we see that the left distributive law fo(g+h) = (fog) +(f oh) is not 
always true, though it does hold when f has a special property called linearity. 


Restriction of Functions 


Our next operation on functions, called restriction, shrinks the domain of a given function 
to a smaller set of inputs. 


5.63. Definition: Restriction. For any function f : A —> B and any set S C A, the 
restriction of f to S is the function f|s : S + B defined by f|s(x) = f(x) for all x € S. 


The graph of f|s is {(x, f(x)) : x € S}, which can be obtained by intersecting the graph 
of f with S x B. You can check that f|s really is a function. Note that f|s and f are 
different functions (except when S = A), since these functions have different domains. 


5.64. Example: Arrow Diagrams of Restricted Functions. Let A = {1,2,3,4,5}, 
B = {a,b,c,d,e}, and define f : A > B by f(1) = c, f(2) = c, f(3) = e, f(4) = b, 
and f(5) = e. Restricting f to the set S = {1,3,5} gives the function fly : S > B 
with fls(1) = c, fls(3) = e, and f|s(5) = e. If we restrict f to T = {2,4}, we get a 
function f|r : T + B with graph {(2,c), (4,b)}. Note f|4 = f, whereas f|g is the empty 
function mapping Ø into the codomain B. Arrow diagrams for f, fls, flr, and f|g are 
shown in Figure 5.7. Note that all of these restricted functions have the same codomain 
as the original function. We could obtain more new functions from these by enlarging or 
shrinking the codomain, as discussed in Example 5.55. 


FIGURE 5.7 
Arrow diagrams for restricted functions. 


5.65. Example: Graphs of Restricted Functions. Suppose g : R > R is the squaring 
function g(x) = z? for x € R. Figure 5.8 displays the graphs of g, 9|10,00); 9|(—00,0]» 9|(—2,2) 
9\{-1,0,1}, and glg. For each S C R, we obtain the graph of g|s from the graph of g by 
intersecting the full graph with the product set S x R. This means that we only keep those 
points (x, g(x)) in the original graph where the first coordinate x is in the set S. 
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FIGURE 5.8 
Graphs for restrictions of the squaring function. 


Gluing Functions Together (Optional) 


We now discuss the gluing operation, which is used to combine two or more functions to 
obtain a single new function with a larger domain. Before presenting the general definition, 
we consider two examples. 


5.66. Example. Consider the three functions f, g, h defined as follows: 


f :{1,2,3,4} > {a,b,c,d} with graph  {(1,a), (2,4), (3, d), (4, b)}; 
g: {1,3,5,7} > {a,b,c,d} with graph  {(1,a), (3, d), (5, ), (7, a)}; 
h: {0,1,5} > {a,b,c,d} with graph  {(0,d), (1, a), (5, d)}. 


Arrow diagrams for these functions appear in the top half of Figure 5.9. Consider the effect 
of merging (gluing together) the diagrams for f and g. The result is shown in the bottom-left 
panel of the figure. In this case, a new function results, namely 


glue(f,g) : {1,2,3,4,5,7} > {a,b,c,d} with graph {(1, a), (2, a), (3, d), (4, b), (5, b), (7, a)}. 
Similarly, combining the diagrams for f and h produces the new function 
glue(f,h) : {0,1,2,3,4,5} > {a,b,c,d} with graph {(0, d), (1, a), (2, a), (3, d), (4, b), (5, d)}. 
On the other hand, if we try to combine the diagrams for g and h, the new graph is 

{(0, d), (1, a), (3, d), (5, b), (5, d), (7, a)}, 


which is not the graph of a function. The trouble is that 5 is in the domain of both g and 
h, but g(5) = b # d = h(5). Thus, when we combine g and h, the input 5 is associated with 
two distinct outputs. 
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FIGURE 5.9 
Arrow diagrams illustrating gluing. 


5.67. Example. Consider the function f : (—oo,0] > R defined by f(x) = —x for x < 0 
and the function g : [0, o0) > R defined by g(x) = x for x > 0. Combining the graphs of f 
and g produces a new function h = glue( f, g) : R > R such that h(x) = —2x for x < 0, and 
h(x) = «x for x > 0. This function h is the familiar absolute value function from calculus. 

On the other hand, suppose we try to glue together the functions fı : R —> R and 
gı : R > R defined by fi (a) = —x and g(x) = x for all x € R. In this case, the union of the 
graphs of fı and gı is not the graph of a function. The trouble is that there exist elements 
x in the common domain of fı and gı such that fı(x) 4 gi(a). Indeed, this inequality holds 
for all x except for x = 0. Note that this difficulty did not arise when we glued together f 
and g, because these two functions agree on the common part of their domains (which is 
the one-point set {0}). 


Now we define the gluing operation for a family of functions. 


5.68. Definition: Gluing. Suppose J is an index set, and for each i € I we have a 
function fi : A; > B; with graph Gi. We call { f; : i € I} a gluable family iff for all i,j € I 
and all x € A; N Aj, fi(x) = f;(x). When this condition holds, we define a new function 
f: Uicr Ai > Use, Bi with graph G = U,-; Gi. We use the notation f = glue(f; : i € J). 


Given x € (Jier Ai, we can compute f(x) as follows. Choose any index i € I such that 
x € Ai; then f(x) = f;(x). Note that the output obtained does not depend on the choice of 
i; if we choose some other index j € I with x € A;, we know f;(x) = fi(a) since {f; : i € I} 
is assumed to be a gluable family. Thus f is a well-defined function. 

Here is a more formal proof that f is a function, based on checking that the graph 
G satisfies the three conditions in the definition of a function. Write A = Le 7 Ai and 
B=Uyje, Bi. First, for each i € I, we know G; C A; x Bi C A x B, so that G = Uje, Gi is 
also a subset of A x B. Second, given z € A, we know there exists i € I with x € A;. Since 
fi is a function, there exists y with (x,y) € Gi, and hence (x,y) € G. Third, fix x,y,z; 
assume (x,y) € G and (a, z) € G; prove y = z. By definition of G, there exist i,j € I with 
(x,y) € Gi and (x, z) € G;. The only ordered pair in G; with first coordinate x is (x, fi(x)), 
so y = fi(x). Similarly, the only ordered pair in G; with first coordinate x is (x, f;(x)), 
so z = f;(x). We also know x € A; (since (x,y) € G; C A; x Bi) and x € A; (since 
(x,z) € G; C A; x B;). By the gluable family condition, we deduce y = f;(x) = f;(x) = z, 
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as needed. So f = (A,B,G) is indeed a function. Conversely, it can be shown that if 
(A, B,G) is a function, then {f; : i € I} must be a gluable family (Exercise 25(b)). 

We remark that if the sets A; are pairwise disjoint (i.e., for alli A j in J, A; N A; = 9), 
then {f; : i € I} is automatically a gluable family. 


5.69. Example: Piecewise Functions. Define a function f : R —> R by setting 


sing for x <0; 

x forO <2 <1; 
ee 1 for 1 < x < 3; 

e7’ for x > 3. 


Formally, we can build f by gluing together the following four functions: 


fi: (—œ,0]—>R given by fi (x) = sina; 
: [0,1] > R given by falx) 
fs : [1,3] > R given by f3(a) = 1; 
: (3,00) > R given by fu(a) =e 


For alli Æ j, fi and fj agree on the overlapping part of their domains. Thus, we can 
construct the overall function f = glue( fı, fo, fs, f4) by taking the union of the graphs of 
the fi- 


Section Summary 


1. Composition of Functions. Given f : A > B and g : B > C, the composition 
go f : A — C is defined by (go f)(x) = g(f(a)) for all a € A. To form go f, the 
codomain of f must equal the domain of g. Function composition is associative 
(meaning ho (go f) = (hog) o f, when both sides are defined) and satisfies 
the identity property f oId4 = f = Idgof. But function composition is not 
commutative: fog Æ go f for most choices of f and g. 


2. Restriction of a Function. Given f : A> B and any subset S C A, the restriction 
fls : S > B is defined by f|s(x) = f(a) for all x € S. The graph of f|s is the 
intersection of the graph of f with S x B. 


3. Gluing Functions. Suppose for each 7 in an index set J, we are given a function 
fi: A; > B; with graph G;. These functions form a gluable family iff for all 
i,j € I and all x € A; N Aj, fi(x) = f;(x). In this case, we get a new function 
f = glue( f; : i € I) with domain U;-; Ai, codomain U;-; Bi, and graph U;<; Gi. 
For x in the domain of f, we have f(x) = f;(x) for any choice of i € I such that 
x € Ai. When the sets A; are pairwise disjoint, {f; : i € I} is always a gluable 
family. 


Exercises 


1. In Example 5.61, compute f oh and ho f. Are these functions equal? 
2. Let A = {1,2,3,4} and B = {v,w, x,y,z}. 
Define f : A > B by f(1) = w, f(2) = z, f(3) =y, f(4) = w. 
Define g : B > A by glv) = 4, g(w) = 3, gly) = 1, g(x) = 2, g(z) = 4. 
Let h : A —> A have graph {(1, 4), (2, 4), (3, 1), (4,3)}. 
Let k : B > B have graph {(v, z), (z, y), (y, v), (w, x), (x, w)}. 
Draw arrow diagrams for the following functions (when defined): (a) f, g, h, k 


252 


15. 


16. 
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(b) fof (c) fog (d) foh (e) fok (f)gof (g)gog (h)goh (i) gok 
G)hof (k)hog ()hoh (m)hok (n)kof (o)kog (p)koh (q) kok. 
Define functions f, g, h,k : R > R by setting f(x) = z?, g(x) = x — 3, h(x) = 2”, 
and k(x) = 5 for all z € R. Find simplified formulas for the following functions. 
Take care not to confuse compositions like f o g with pointwise products like fg. 
(a) fog (b) gof (c)fg (gf (e) foh (f)hof (g) gok (h)kog 

(i) ko(f+g) () (kof)+(kog) (k) ho(f+g) (1) (ho f)+ (hog) 

(m) (ho f)(hog) (n) (f+g)oh (0) (foh)+(goh) (p)(foh)(goh). 
Define functions f : Ryg Ryo by f(x) = vz, g : Rso > R by g(x) = Ing, 
h: E E a a a E 


ilz) = 

a) For re p,q € {f,g,h,k, 7} is the composition po q defined? 
b) Check that h o g = Idp,, and g o h = Idr. 

c) Show that klp,, ° f =J. 


) 
a) Prove: for all sets A, B, C, if AC BC C, then jg,c ° ja,B = jA,cC- 
(b) Prove: for all sche A aid E; if A C B, then jap = Idg |a. 


Let A be a set and g : A — A a function. For n > 0, we recursively de- 
fine the powers g9? : A — A as follows: g? = Id4, and g” = g™ log 
for n > 1. Let A = {1,2,3,4,5,6,7,8}, and let g : A —> A have graph 
{(1, 4), (4, 7), (7, 2), (2, 5), (5, 1), (3, 8), (8, 6), (6, 3)}- 

(a) domnie the functions g” for O0 < n < 5. (b) Does there exist n > 0 such 
that g” = Id4? If so, what is the smallest such n? If not, why not? 


( 
( 
( 
( 


Repeat Exercise 6 for g : A > A with graph 


{(2, 6), (6, 4), (4, 8), (8, 2), (1, 3), (3, 2), (5, 8), (7, 7)}- 


Repeat Exercise 6 for g : A > A defined by g(x) = x +1 for 1 < x < 7, and 
g(8)=1. 

Let A = {1,2,...,10}. Find functions g : A —> A such that: (a) g # Id4 but 
g? = Ida; (b) g° = Id4 but g? A Id, for 0 < j < 10; (c) g?! = Ida but gi Æ Ida 
for 0 < j < 21; (d) gi # Idą for all j > 0. 

Supply a reason for each equality in (5.1) and (5.2). 

(a) Prove: for all functions f : A > B, Idgof = f. 

(b) Prove: for all f : A >R and all g,h : R > R, (g-h)o f = (go f) (hof). 
(a) Give an example of f,g,h : R —> R where ho (f +g) £ (ho f) + (hog). 

(b) Give an example of f,g,h :R —> R where ho(f-g) 4 (ho f)- (hog). 

Let f,g,h:R— R be functions. Suppose h has the following linearity property: 
h(a +y) = h(x) + h(y) for all x,y € R. Prove that ho(f+g) = (ho f)+ (hog). 
Let f,g,h : R — R be functions. Find a condition on h that ensures that ho (f - 
g) = (ho f)- (hog) for all f,g : R > R. Prove your answer. 

Let f : R > R be defined by f(x) = sin x. Draw graphs of the following restric- 
tions of f: (a) f\[-2/2,n/2] (b) flfo,amy (©) Fl-7274) (d) Fl 0,0/6,/4,0/3,0/2,30/4,0} 
(e) flo (f) flikr/2: kez} 

Let f : R —> R be defined by f(x) = |x|. Draw graphs of the following restrictions 
of f: (a) F\fo,3] (b) f\(—3,-1up1,2) (c) flio} (d) fiz (e) flr-z (£) flo 
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17. Consider the function g : {1,2,3,...,8} > {v,w, x,y, z} with graph 
{(1, w), (2, z), (3, y), (4, w), (5, x), (6, £), (7, y), (8, x)}- 


Draw arrow diagrams for: (a) g (b) gl{1,3,5,73 (©) 913,45} (d) glysy (e) glo 
(f) gliaca: gla)=r} (g) gliaca: gla)=v}: 

18. Let A = [0,2] and B = [2, 00). Which of the following pairs of functions f : A > R 
and g : B — R are gluable? (a) f(x) = 2x, g(x) = 4. (b) f(x) = x?, g(x) = 2z. 
(c) f(x) = 2°, g(x) = z. (d) f(z) = e?, g(x) = 2/2. (e) f(z) = vz, 
g(x) = 2/V2. (£) f(x) = |z|, g(x) = 3- x. 

19. Consider the following four functions: 
f : {1,2,3} > Z given by f(1) = 4, f(2) = 0, f(3)= 4 
g : {0,2,4} > Z given by g(0) = 2, g(2) = 2, g(4) = 2; 
h : {0,1,2} > Z given by h(0) = 2, h(1) = 4, h(2) = 2; 
k : {0,1,3} > Z given by k(0) = 2, k(1) = 4, k(3) = 3. 
Which of the following pairs of functions are gluable? For those that are, draw 
an arrow diagram illustrating the glued function. (a) f and g; (b) f and h; (c) f 
and k; (d) g and h; (e) g and k; (f) h and k; (g) f and f. 

20. Assume f : X > Y and g : Y —> Z are functions and S C X. Prove that 
(g0 f)ls = g0 (fls). 

21. Assume f: A> Y and g : B —> Y are gluable functions and S C AU B. Prove 
that fı = flans and gı = g|Bns are gluable functions, and that glue(f1, 91) = 
glue(f, g)\s- 


22. Let X be a set. For fixed A C X, show how the characteristic function x4 can 
be built from the constant functions 0x and 1x on X by using restriction and 
gluing operations. 

23. Suppose f : X > Y is a function and S C T C X. (a) Prove that (f|r)|s = fls. 
(b) Prove that f|x = f. (c) Prove that flo = (0,Y,0). 

24. Suppose f,g : A — R are functions and S C A. Prove the following facts relating 
restriction to pointwise operations: (a) (f + g)|s = (fls) + (gls); (b) (f9)ls = 
(fls)(gls); (c) the restriction of a constant function on A with value co is the 
constant function on S with value co. 


25. Suppose that for alli € I, fi : A; > B; is a function with graph G;. (a) Show that 
{fı : i € I} is a gluable family iff for all i,j € I, fi j- (b) Show 
that if U,-; Gi is the graph of a function, then {f; : i € I} is a gluable family. 

26. Suppose { f; : i € I} is a gluable family, where f; : A; > B; for each i € I, and 
let h = glue(f; : i € I). (a) ;. (b) Prove: for any 
g : Vier Ai > Vier Bis i 

27. Suppose that h : A —> B is any function, and {A; : i € I} is a family of sets with 
A= User i:i €I} is a gluable family such 
that glue(h; :i € I) =h. 


tel 


i for alla € J, then g = h. 
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5.7 Direct Images and Preimages 


In earlier sections, we discussed the image of a set A under a relation R or the inverse 
relation R~!. Intuitively, R[A] is the set of all outputs that are related under R to at least 
one input in the set A. On the other hand, R~![B] is the set of all inputs that are related 
under R to at least one output in the set B. Here we introduce similar constructions where 
the relation R is replaced by a function f. 


Direct Images and Preimages 


5.70. Definition: Direct Image of a Set. Given a function f : X > Y andaset AC X, 
the direct image of A under f is f[A] = {f(x) : x € A}. Equivalently, for all objects y, 


y € FIA] | if | 3x € A,y = f(a). 


me 


Intuitively, f [|A] is the set of all outputs we get by applying f to every input in the set 
A. If G is the graph of f, we could rephrase this definition by writing f[A] = G[A] = {y : 
da € A, (x,y) € G}. Thus, the direct image of a set under a function is a special case of 
the image of a set under a relation. In fact, this formulation is a little more general since 
we could apply it to any set A (not just subsets of the domain X). However, when dealing 
with functions, it is sometimes easier to use the version of the definition phrased in terms 
of function notation. 


5.71. Definition: Preimage of a Set. Given a function f : X —> Y anda set BC Y, 
the preimage of B under f is f~'[B] = {x € X : f(x) € B}. Equivalently, for all objects z, 
x € f—*[B]| if |x € X and f(x) € B|. The set f~'[B] is also called the inverse image of 
B under f. 


Intuitively, the preimage f~'[B] is the set of all inputs in the domain of f that map to 
an output in the set B when we apply f. As before, if f has graph G, we could have also 
defined f~'[B] = G~*[B], which is the image of B under the inverse of the relation G. 


5.72. Warning. The notation f~'[B] does not mean that the function f has an inverse 
function f—!. (We discuss inverse functions in §5.9.) The preimage construction defined 
here can be applied to any function, not just those functions that have inverses. We also 
call your attention to our use of square brackets for images of sets under relations, direct 
images of sets under functions, and preimages of sets under functions. We use brackets 
rather than round parentheses to avoid confusion with function value notation, as shown in 
Example 5.74 below. 


5.73. Example. Let f and g be the functions shown earlier in Figure 5.6. Recall f has 
graph {(1, b), (2, e), (3, c), (4,c), (5, a)}, and g has graph {(a, a), (b,c), (c, d), (d, b), (e, e)}. We 
compute H3, 5} 7 {c, a}, f He, a}] = {3, 4, 5}, f Ho, d,e}] E= {1, 2}, FHL, 2}] = {b, e}. 
Moreover, g[{b, c, d}] = {b,c, d} = g~+[{b, c, d}], g[{c}] = {d}, g7"[{a, d,e}] = {a,c,e}, and 
g{0] = 0 = g7 + [0]. Finally, (g o f)[{3, 4, 5}] = {d,a}, (go F) HO} = 0, and 


(go PIT Ha, dH] = (9° FSH] = {a}. 
5.74. Example. Let X = {4,5,6,{4}, {4,5,6},0}, and define a function f : X —> X by 
setting f(4) = {4}, f(5) = 4, f(6) = 0, F({4}) = 5, F({4,5,6}) = 9, and f(0) = {4}. An 


arrow diagram for this function appears below. 
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The set X is interesting because some of the members of X (namely {4} and {4,5,6} and 
Ø) are also subsets of X. Let us compute some direct images, function values, and preimages 
for f. First, f[{5,6}] = {4,0} since the arrows leading away from the inputs 5 and 6 lead to 
4 and Í. Similarly, f[{4, {4}, Ø} = {{4}, 5}. Computing some function values now, note that 
f(5) = 4, f(4) = {4}, F({4}) =5, £({4, 5, 6}) = 0, and f (0) = {4}. In contrast, if we switch 
to square brackets, we compute the direct images f[{4}] = {{4}}, f[{4,5, 6}] = {{4}, 4, 0}, 
and f [0] = 0. Turning to preimages, we find that f—'[{4, 6}] = {5}, f-*[{4, 5, 6}] = {5, {4}}, 
f— (44,5, 6}}] = 0, f-2[0] =0, and f-1[{0}] = {6, {4,5, 6}}. Finally, f-}[5] is not defined 
since 5 is not a subset of X; on the other hand, f~![{5}] = {{4}}. 


5.75. Example. Define f : [—27,47] — R by letting f(x) = cosa for all x € [—27, 4r]. 
The graph of f is shown below. 


‘x 

-2L 
We have, for example, f[[0, 7/2]] = [0,1], f[Rso] = [-1, 1], f-*[{1}] = {—27, 0, 27, 47}, 
f-"[Reo] = (—81/2,—m/2) U (1/2, 3m/2) U (50/2, 71/2), FUR] = H-1] = 
{—r,7,37}, and f~*{(1,2]] = Ø. Looking at some restrictions of f, we have 


(fiom) R<o] = (7/2, 7], (flrs) 7 [[L, 00)] = {27, 47}, and (f|z)~*[Z] = {0}. 


Properties of Direct Images and Preimages 


The Theorem on Images (for relations) specializes to give theorems on direct images and 
preimages of sets under functions. As we see below, the preimage construction has some 
especially nice additional properties. 


5.76. Theorem on Direct Images. Let f : X — Y be a fixed function, let I be a fixed 
nonempty index set, and let A, C, A; (for each i € I) be subsets of X. Then: 

(a) Empty Image: flO] = 9. 

(b) Codomain Property: f[A] CY. 
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(c) Monotonicity: If A C C, then f[A] C f[C]. 

(d) Image of Union: f[AU C] = f[A] U f[C] and f[U;-7 Ai] = Ujez Fil. 

(e) Image of Intersection: f[ANC] € f[A]N f[C] and f[) <7 Ail E Nier fA. 
Equality is not always true. 

(£) Image of Set Difference: f|A] — f[C] C f[A — C]. Equality is not always true. 

(g) Image Under a Composition: For all g : Y > Z, (go f)[A] = g[f[A]]. 


Proof. This theorem follows immediately from the corresponding theorem for relations, and 
the fact that f[A] = G[A] where G is the graph of f. To illustrate the definition of direct 
image using function notation, we prove part (f). Fix f : X — Y and subsets A,C C X. 
Fix an object y € f[A] — f[C]; prove y € f[A—C]. We know y € f[A] and y ¢ f[C]. On one 
hand, y € f[A] means there is an x € A with y = f(z). This x cannot belong to C, since 
otherwise y € f[C], which is not true. So x € A — C. Since y = f(x) with x € A — C, we 
have y € f[A — C]. 

To see that equality is not always true in (f), we give an example. Define f : {1,2} > 
{3,4} by f(1) = 3 = f(2). Take A = {1} and C = {2}. Then f[A] = f[C] = {3}, so 
f[A] — f[C] = Ø. On the other hand, A—C = {1}, so f[A —C] = {3}. We see that Ø C {3}, 
but equality does not hold. 


Now we consider preimages. Some parts of the next theorem follow by applying the 
Theorem on Images to the relation G~! (the inverse of the graph of f), but here some key 
additional properties turn out to be true. 


5.77. Theorem on Preimages. Let f : X — Y be a fixed function, let I be a fixed 
nonempty index set, and let B, D, B; (for each i € I) be subsets of Y. Then: 

(a) Empty Preimage: f~1[0] = 0 

(b) Domain Property: f~'[B] € X. 

(c) Preimage of Codomain: f~'[Y] = X. 
(d) Monotonicity: If B C D, then f~'[B] € f-+[D]. 


(e) Preimage of Union: f-'[B U D] = f-1[B]U f-1[D] and f~t 


Le 
(B: 


tel 


= ie 


tel 


a (ae as 


tel 


(£) Preimage of Intersection: f~![BND] = f-'[B]nf-![D] and f7! 


g) Preimage of Set Difference: f~'[B| — f~'[D] = f~'[B — D]. 

h) Image of Preimage: f|f~+{[B]] C B. Equality is not always true. 

Preimage of Image: For all AC X, AC f7'[f [Al]. Equality is not always true. 
Preimage Under a Composition: For all h : W —> X, (f o h)~*[B] =h71[f-1[B]]. 


( 
( 


( 
( 


j 
We prove some parts of this theorem to illustrate the definition of preimages in terms 
of function notation. 


5.78. Proof of Preimage Property for Intersections. With the notation of the theo- 
rem, let us prove that f~t [Nier Bi] = Nicer f~'[Bi] via a chain proof. The sets appearing 
on the left and right sides of this identity are both subsets of the domain X. So, to prove 
they are equal, we can fix an arbitrary object x in X and show «x is in the left set iff 
x is in the right set. For fixed x € X, note x € f'[N;er Bi] iff f(x) € Mier Bi iff for 
alli € I, f(x) € B; iff for all i € I, x € f'[Bi] if « € Mier f [B:]. The property 
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f-'[Bn D] = f-'[B] n f71[D] is a special case of this result (let the index set I have two 
elements). 


5.79. Proof of Image of Preimage Property. Fix B C Y; we must prove f[f~![B]] € B. 
Fix y; assume y € f[f~'[B]]; prove y € B. By definition of direct image, there exists 
x € f-1[B] with y = f(x). By definition of preimage, x € X and f(x) € B. Since y = f(z), 
we conclude that y € B. 

Here is an example to show that f[{f~'[B]] = B is not always true. Define f : {1,2,3} > 
{4,5,6} by letting f(1) = f(2) = f(3) = 5. Choose B = {4,5}. Then f~'[B] = {1,2,3}, 
so f[f—*[B]] = {5}. Note that {5} is a subset of B, as promised by the theorem, but this 
subset is unequal to B. 


5.80. Proof of Preimage of Image Property. Fix A C X; we must prove A C f~![f/A]]. 
Fix x; assume x € A; prove x € f~'[f[A]]. We must prove x € X and f(x) € fA]. Since 
xz € A and A C X, we see that x € X is true. To prove f(x) € f[A], we must prove 
da € A, f(x) = f(a). Choose a = x, which is in A by assumption. Then f(x) = f(a) 
certainly holds. 

Here is an example to show that A = f~'[f[A]] is not always true. Define f : {1,2,3} > 
{4,5,6} by letting f(1) f(2) f(3) 5. Choose A = {3}. Then f[A] = {5}, so 
f-*[F[A]] = (1, 2,3}, which contains A but is unequal to A. 


5.81. Example. Define a function f : R > R by f(x) = 2? for all x € R. The graph of f 
is shown here. 


-2t 

Let A = [1, 2], considered as a subset of the domain of f. We compute f[A] = [1,4], and 
FHI] = fH, 4]] = [-2, -1] u [1, 2]. Thus A is a proper subset of f~+[f[A]]. Next let 
B = {-3,4], considered as a subset of the codomain of f. We compute f~'[B] = [-2, 2], 


and f[f~'[B]] = f[[-2, 2]] = [0,4]. Thus f[f~1[B]] is a proper subset of B. 


Section Summary 


1. Direct Images and Preimages. Let f : X — Y be a function. For all subsets A of 
X and all objects y, y € f|A] iff 3x € A,y = f(x). For all subsets B of Y and all 
objects x, x € f—*[B] iff x € X and f(x) € B. If G is the graph of f, we have 
f[A] = G[A] and f~'[B] = G"![B]. The preimage notation f~![B] can be used 
for all functions, not just functions that have inverses. 


2. Properties of Preimages. The preimage construction preserves set concepts such as 
union, intersection, set difference, set inclusion, and the empty set. For instance, 
the preimage of an arbitrary union of sets is the union of the preimages of these 
sets. 
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3. Properties of Direct Images. The direct image construction preserves set inclu- 
sions, arbitrary unions, and the empty set. But the set inclusions f[A N C] C 
FLA] FIC], FLA] — FIC] E FIA- C), FII] E B, and A c f~"[f[A]] are not 
equalities in general. 

Exercises 
1. Let X = {1,2,3,4,5,6}, and let f : X > X have graph {(1, 4), (2,3), (3,4), (4, 1), 
5,6), (6,4)}. Draw an arrow diagram for f. Use this to compute 
FU2,3,4}], fH, 3,4}, F254, F H2 SH, FI HA 5, HL, FHA, 5, GH) 
fo PHL 3,5}] and (fo F) 7'I{I, 3,5). 

2. For each f : R —> R, sketch the graph of f. Use this to compute the direct image 

fI(—1,1)], the preimage f~1[(—1,1)], and f~1[{—4}}. 
a) f(a) = 3a +5. 

b) f(x) = 42°. 

c) f(x) = |a| for x £0, f(0) = —4. 

d) f = xr-z (a characteristic function). 

e) f(x) = sinz. 

f) f(x) = ln |z| for x 4 0, f(0) = 0. 

3. For each real-valued function below, describe the preimage sets f~'{{c}] for each 
c € R (these sets are called level sets of the function). Sketch the level sets for 
c= —1,0,1,4 in each case. 

(a) f : R? 3 R given by f(x,y) = y. 

(b) f : R? > R given by f(x,y) =£ +y. 

(c) f : R? > R given by f(x,y) = 4. 

(d) f : R? > R given by f(x,y) = z? + y2. 

(e) f : R? > R given by f(x,y) = yz? + y2. 

(£) f : R? > R given by f(x,y) = y1 + zr? + y2. 

4. Let X = {1,2,3}UP({1,2}). Define f : X > X by letting f(1) = {1}, f(2) = Ø, 
f(3) = 2, and f(A) = {1,2} — A for all A C {1,2}. 

(a) Draw an arrow diagram for f. 

(b) Compute (if defined) f(1), F({1}), FADH, F, FHL, and FUH: 

(c) Compute (if defined) #({1,2}), FUL DH, F, 2J), FUL 2), FL (2H, 
FUD, 2), and FLY, 2H. 

(d) Compute FH}, F RÐ, FO FRO, FHL], and 
F 2H. 

5. Use the definition of direct image based on function notation to prove parts (c) 
and (e) of the Theorem on Direct Images. 

6. (a) Prove part (g) of the Theorem on Direct Images. 

(b) Prove part (j) of the Theorem on Preimages. 
7. Given sets A C X, let xa : X —> R be the characteristic function for A. 
(a) Find all possible direct images 4[B], as B ranges over subsets of X. Which 
choices of B produce each possible answer? 
(b) Find all possible preimages x4 [C], as C ranges over subsets of R. Which 
choices of C produce each possible answer? 
8. Suppose f : X + Y and $C X. Prove: for all B CY, f|5'[B] = f-1[B]N S. 
9. Suppose f:X —~Y,ACX,and BCY. 


(a) Prove: BN f[A] = f[f7![B]n A]. 
(b) Prove: f[f-![B] U A] C BU f [A]. 
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10. 


11. 


12: 
13. 
14. 


15. 


16. 


(c) Prove: f-1[B]N AC fTHIB A FIAJ. 
(d) Prove: f-'[B] UAC fTH[B U fA]. 


Give examples to show that the set inclusions in parts (b), (c), and (d) of the 


previous exercise are 


not always equalities. 


259 


(a) Disprove: for all relations R and all sets A, A C R7![R[A]]. 
(b) Prove or disprove: for all relations R and all sets B, R|R-t+[B]] € B. 


Prove: for all functions f : X + Y and all A C X, f[f~*[f[A]]] = fA]. 
Prove: for all functions f : X > Y and all B C Y, f-"[f[f7+[B]]] = f-+[B). 


Do the statements in the previous two exercises remain true if we only assume 
that f is a relation from X to Y? 


flA] - FIC] = f[A - 


f-*[f[A]] = A for all 


a) Find (with proof 
f[ANC] = f[A] A f[C] for all A,C C X. 
b) Find (with proof) a property of a function f 


a) Find (with proof 


b) Find (with proof 


) a property of a function f 


C] for all A,C C X. 


) a property of a function f 
ACX. 
) a property of a function f 


fIf-'[B]] = B for all B C Y. 


: X — Y that guarantees 


: X — Y that guarantees 


: X — Y that guarantees 


: X — Y that guarantees 
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5.8 Injective, Surjective, and Bijective Functions 


This section studies three special kinds of functions, called injections, surjections, and bi- 
jections. Intuitively, a function is injective (one-to-one) iff distinct inputs to the function 
always map to distinct outputs. A function is surjective (onto) iff every potential output 
in the codomain of the function is actually attained as the value of the function at some 
input. A function is bijective iff it is both one-to-one and onto. 


Definition of Injections, Surjections, and Bijections 


We introduce the concepts of injective, surjective, and bijective functions using arrow dia- 
grams. Informally, a function f : X — Y is injective (or one-to-one) iff every point in Y 
has at most one arrow entering it. This means that every potential output occurs as the 
value of the function for either zero or one inputs. The function f is surjective (or onto) iff 
every point in Y has at least one arrow entering it. This means that every potential output 
in the codomain really does get used as an actual output assigned to one of the inputs in 
the domain. Finally, the function f is bijective (or a one-to-one correspondence between X 
and Y) iff every point in Y has exactly one arrow entering it. This means that for every 
output in Y, there exists exactly one input in X mapping to that output. Compare this to 
the requirement in the definition of a function that every input in X has exactly one arrow 
leaving it. The relation between these conditions is explained in the next section when we 
discuss inverse functions. 


5.82. Example. Consider the following four arrow diagrams. 


x f Y x 8 Y 
eae 
i | 

x h Y 


y 
\ 


First of all, each of the arrow diagrams does represent a function, since each input 
x in the domain X has exactly one arrow leaving it, and all arrows go from X to Y. 
Second, f and h are injective functions, because every output in Y has at most one arrow 
coming in. The function g is not injective because the output c has two arrows coming in: 
g(2) = c = g(3). Similarly, k is not one-to-one because k(2) = b = k(3). Third, f and g 
are surjective functions, because every output in Y has at least one arrow coming in. The 
function h is not surjective because c is not hit by an arrow; similarly, k is not onto. Fourth, 
f is a bijection because each output in Y has exactly one arrow coming in. The other three 
functions are not bijective. 
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Next we give the formal definitions of injections, surjections, and bijections. At this time, 
it might be helpful to review §1.6 to recall how the phrases “at most one” and “exactly 
one” can be encoded using logical symbols. 


5.83. Definition: Injective (One-to-One) Function. Given a function f: X >Y, 


f is injective | if | Vw € X, Yz € X, f(w) = f(z) > w=). 


When this holds, we also say | f is one-to-one | or | f is an injection |. 


The definition of injective function does not explicitly mention members of Y, but these 
outputs appear implicitly in the expression f(w) = f(x). The implication in the definition 
says that if two inputs w and x ever map to the same output under f, then the two inputs 
must in fact be the same. The contrapositive of the definition may also have more intuitive 
appeal, though it may be more difficult to use in proofs: f is one-to-one iff for all w,x € X, 


w + x implies f(w) # f(x). 
5.84. Definition: Surjective (Onto) Function. Given a function f: X >Y, 


f is surjective | iff | Vy € Y, Jx € X,y = f(x) |. 


When this holds, we also say | f is onto | or | f is a surjection |. 


5.85. Definition: Bijective Function. Given a function f: X >Y, 


f is bijective | iff | f is injective and f is surjective |. 


Equivalently, | f is a bijection | if | Vy € Y,3!x € X, y = f(x) |. 


5.86. Example: Denials of the Definitions. What does it mean to say that a function 
f:X >Y is: (a) not injective; (b) not surjective; (c) not bijective? 
Solution. Using the denial rules, we find: 


(a) | f is not injective | if | 3w € X, 3x € X, f(w) = f(r) Aw A z|. 


(b) | f is not surjective | iff 


y € Y,Vx € X,y # f(z) |. 


(c) | f is not bijective | iff | f is not injective or f is not surjective |. 


Examples of Injections, Surjections, and Bijections 


5.87. Example: Exponential Function. Define f : R > R by f(x) = e for all xz € R. 
The graph of f is shown here. 


-1.5 -1.0 -0.5 0.5 1.0 1.5 
-141 


-2L 
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Each vertical line £z = xo (for xo € R) intersects the graph of f in exactly one point; 
for this reason, f is a function. On the other hand, each horizontal line y = yo (for yo € R) 
intersects the graph of f in at most one point. This means that every output yo has the form 
f(xo) for at most one input zo. So, f is injective. (Testing injectivity in this way is called 
the Horizontal Line Test.) But, for certain choices of yo € R, the horizontal line y = yo 
does not intersect the graph of f anywhere. For instance, this holds for yo = 0, yo = —1, or 
any negative value of yo. This means that f is not surjective, and hence not bijective. On 
the other hand, consider the function fı : R > Ryo, given by fı(x) = e” for all x € R. The 
function fı is not the same as f, because the two functions have different codomains. We 
see that fı is injective, surjective, and bijective, since every yo in the new codomain Ryo 
has the form yo = f (xo) for exactly one input zo € R, namely zo = Inyo. 


5.88. Example: Functions from Calculus. Define relations F, G, H, K, by setting 
F = {(x,x£3) : x € R}, G={(z,27): x € R}, H = {(z,sinz) : z € R}, and 


K = {(a,y): x € R, tan z is defined, and y = tan z}. 
The graphs of these relations are shown here: 


y 


We cannot apply the adjectives injective, surjective, or bijective to any of the graphs F, 
G, H, or K, because these graphs are relations, not functions. Each graph can be used to 
produce many different functions, depending on which domain and codomain are chosen. 
Consider these examples. 


(a) fı : R > R with graph F is injective and surjective and bijective. 


(b) fo: R > R with graph G is not injective and not surjective and not bijective. 


(c) fs : R + Rso with graph G is surjective but not injective. 


(d) f4 : R>o > Rso with graph G is not a function, because the graph G is not a subset of 


(e) fs : R>o + Rso with graph GN (Rso x Rso) is injective and surjective and bijective. 
In this case, the domain is the positive x-axis, the codomain is the positive y-axis, and the 
graph of fs is the part of the parabola G in the first quadrant. 
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(£) fe : R > R with graph H is not injective (e.g., because fg(7) = 0 = fe(—7). fe is not 
surjective (e.g., because there does not exist xo € R with fe(xzo) = 2). fe is not bijective 
(e.g., because it is not injective). 

(g) fz : R > [-1,1] with graph H is surjective but not injective. Note that we obtain f7 
from fg by restricting the codomain. 

(h) fg : [-7/2, 7/2] > R with graph H A ([-7/2, 2/2] x R) is injective but not surjective. 
Note fs = feé|{—x/2,7/2] is a restriction of fe. 

(i) fo : [>7r/2, 7/2] > [-1,1] with graph H N ([—7/2, 2/2] x R) is injective and surjective 
and bijective. Note that fo is obtained from fg by restricting both the domain and the 
codomain; these modifications have converted a non-bijection into a bijection. 

(j) fio : [0,7/2] > [0,1] with graph H N ([0, 7/2] x R) is also injective and surjective and 
bijective. This is a different restriction of fg that produces a bijection. 

(k) fi: : R > R with graph K is not a function, because there exist inputs £o in the domain 
R with no associated outputs. For example, for all y, (7/2, y) Z K, so the input 7/2 has no 
corresponding output. 


(1) To fix the previous example, let A = {17/2 + kr : k € Z} be the set of real numbers x 
where tanx is undefined. Then fi2 : R— A — R with graph K is a function, and fj, is 
surjective but not injective. 

(m) fig : (-2/2,7/2) > R with graph K N ((—7/2,2/2) x R) is a function that is in- 
jective, surjective, and bijective. The same is true of fi4 : (7/2,37/2) > R with graph 
KA ((a/2, 37/2) x R). 


These examples show that a non-surjective function f : X — Y can be changed into a 
new, surjective function by shrinking the codomain from Y to the image f[X]. Similarly, 
we can change a non-injective function g : X — Y into a new function that is injective if 
we are willing to shrink the domain X sufficiently. We might need to shrink the domain 
by a lot, however. For instance, let g : R > R be the constant function given by g(x) = 3 
for all x € R. In order to manufacture an injective function from g, we need to shrink the 
domain all the way down to a single point! The new function is not surjective either; to 
obtain surjectivity, we must also shrink the codomain to the set {3}. The resulting function 
(mapping a one-point set to a one-point set) is not very interesting, but it is a bijection. 


Proofs of Injectivity, Surjectivity, and Bijectivity 


Here we consider some examples to see how to prove that a function is or is not injective, 
surjective, or bijective. 


5.89. Example. For any set X, the identity function Idx : X — X is a bijection. To prove 
injectivity, fix w,x« € X; assume Idx(w) = Idx(x); prove w = x. Using the definition of 
Idx, the assumption becomes w = a, as needed. To prove surjectivity, fix y E€ X; we must 
prove Jx € X, Idx (ax) = y. Choose x = y; then x € X and Idx (a) = x = y, as needed. 


5.90. Example. Define f : (R — Z) > Z by f(x) = |x], which is x rounded down to 
the nearest integer. Let us show that f is surjective but not injective. For surjectivity, fix 
yo € Z. Choose zo = yo + 1/2, which does lie in the domain R — Z of f. Rounding zo down 
to the nearest integer produces yo, so f(xo) = yo. Next, to prove f is not injective, choose 
x = 1/3 and w = 2/3, which lie in R — Z. Note x # w, but f(x) =0 = f(w). 


5.91. Example. Define g : Z>9 > Zo by g(n) = 6”. Let us show that g is injective but not 
surjective. For injectivity, fix n,m € Z>ọ and assume g(n) = g(m). We must prove n = m. 
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The assumption means 6” = 6”, which is equivalent to 23” = 2™3™., By uniqueness of 
prime factorizations, we conclude that n = m. So g is injective. To show that g is not 
surjective, choose yo = 5. We must prove g(x) # 5 for all x € Z>o. Consider two cases. If 
x = 0, then g(x) = 15. If x > 0, then g(x) = 6” > 6t = 6 > 5, so again g(x) #5. So g is 
not surjective. (Alternatively, we can see that g(x) 4 5 by noting that g(x) = 2737 = 5 = 5! 
would violate the uniqueness of the prime factorization of 5.) 


5.92. Example. Define h : R — {3} > R — {2} by h(x) = #4 for all x 4 3. Let us prove 
that h is a bijection. First we show h is injective. Fix x, z € RE {3}; assume h(x) = h(z); 
prove x = z. Our assumption means 22t1 = 22+}. Using algebra to rewrite these fractions, 
we deduce 


24 7 =24 i ; 
z—3 z—3 
Subtracting 2 from both sides gives ty = i, hy = z. Multiplying 
by the nonzero numbers z — 3 and x — 3 yields z — 3 = x — 3. Finally, adding 3 to both sides 
gives z = x, so x = z is indeed true. 


Next, we show that h is surjective. Fix yo € R — {2}; we must find x € R — {3} such 


that h(x) = yo, or equivalently 2 + Ge = yo. To see eee x to choose, we do a scratch 
calculation in “ae we solve for x in terms of yo. We get -55 = yo — 2, then a = =) 
then x — oe = me zz; then x = 3 Er . Returning to the | official proof, we now choose 
zr = 3+ za . Observe that this anes is defined, since yo # 2. Furthermore, Wo is 


nonzero, Ly so x is unequal to 3. This proves that x does belong to the domain R — {3} of 
h. Finally, we conclude the proof by computing 


2(3 + 7/(yo — 2)) +1 _ 7+14/(yo — 2) _ (Zima) (a) 2 
(3+ 7/(yo — 2)) -3 7/(yo — 2) Yo —2 7 Yo- 


h(x) = 


Properties of Injections, Surjections, and Bijections 


We conclude this section with a fundamental theorem showing how composition of functions 
is related to the properties of being injective, surjective, or bijective. 


5.93. Theorem on Injections, Surjections, and Bijections. 

For all functions f : X —> Y and g: Y > Z: 

(a) If f is injective and g is injective, then go f : X — Z is injective. 

(b) If f is surjective and g is surjective, then go f : X —> Z is surjective. 

(c) If f is Pak and g is bijective, then go f : X — Z is bijective. 

(d) If go f is injective, then f is injective (but g may or may not be injective). 
(e) If go f is surjective, then g is surjective (but f may or may not be surjective). 


We prove parts (a), (b), and (e) here, asking you to prove (c) and (d) in the exercises. 
For all proofs, fix arbitrary functions f : X —> Y and g : Y —> Z; we know go f: X > Z is 
also a function, and (go f)(x) = g(f(x)) for alla € X. 


5.94. Proof of (a). Assume f is one-to-one and g is one-to-one; prove go f is one-to-one. 
We have assumed 
Vzı € X,Vx2 E€ X, f(x) = f (x2) > T1 = T2 (5.3) 


and 
Vy1 € Y, Yy € Y, gy1) = g(y2) > yı = y2. (5.4) 


Keeping in mind that go f has domain X, we must prove 


Vu, € X, Vw2 EX, (go f)(wi) = (go f) (we) > W1 = Wo. (5.5) 
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Fix arbitrary wı € X and w2 € X. Assume (g o f)(wi) = (g o f)(w2). Prove wy = we. 
We have assumed g(f(w1)) = g(f(w2)). We intend to use (5.4) and the Inference Rule for 
ALL, taking yı = f(wi) and y2 = f(w2). These choices are legitimate, because f(wi) € Y 
and f(w2) € Y (recall f maps X into Y). Now our assumption g(f(w1)) = g(f(w2)) 
becomes g(y1) = g(y2). By the Inference Rule for IF and (5.4), y1 = ye follows. This means 
f(wi) = f(w2). Now we use the rules for ALL and IF again, taking xı = wı € X and 
z2 = we € X in (5.3). We deduce wı = w2, which is the goal. 

Here is a less formal, but potentially more intuitive, contrapositive proof of the same 
result. Start with two arbitrary unequal inputs wı Æ w2 in X. Since f is injective, yı = f(wi) 
and y2 = f(w2) are unequal intermediate outputs in Y. Since g is injective, g(f(wi)) and 
g(f(we)) are unequal final outputs in Z. So, wi # w2 implies (g o f)(wi) 4 (go f)(wa), 
which means that go f is one-to-one. 


5.95. Proof of (b). Assume f : X > Y is onto and g : Y > Z is onto; prove go f : X > Z 
is onto. We have assumed Vy € Y, Jx € X,y = f(x) and Vz € Z, dy € Y,z = g(y). We must 
prove Vz € Z, dx € X,z = (go f)(x). To do so, fix an arbitrary zo € Z. By our assumption 
on g, there is a fixed object yo € Y with zp = g(yo). Now by our assumption on f, there is 
a fixed object xo € X with yo = f(x). We see that zo = glyo) = g(f(%o)) = (g o f) (a0). 
Thus to prove dx € X, zo = (go f)(x), we may choose x = xo. 


5.96. Proof of (e). Assume go f is surjective. Prove g is surjective. We have assumed 
Vz € Z,Jxz € X,z = (go f)(x). We must prove Vz € Z,Jy € Y,z = g(y). Fix arbitrary 
zo € Z. Prove dy € Y,z = g(y). Using our assumption, we know there is some object 
zo E X with zo = (go f)(xo). This means zo = g(f(x%o)). Comparing to the goal, we may 
choose yo = f (ao) to make zo = g(yo) be true. Observe that yo does lie in the required set 
Y, since f is a function with codomain Y. 

To justify the parenthetical remark in (e), we give an example where go f is surjective 
but f is not surjective. Let X = {1,2}, Y = {1,2,3}, and Z = {4,5}. Define f: X > Y 
by f(1) = 1 and f(2) = 2. Define g : Y > Z by g(1) = 4 and g(2) = 5 and g(3) = 5. 
Now go f : X > Z satisfies (g o f)(1) = 4 and (go f)(2) = 5. It is immediate that go f 
is surjective (in fact, bijective), yet f is not surjective since 3 € Y does not have the form 
f(x) for any x € X. 


Section Summary 


1. Injective Functions. A function f : X — Y is injective (one-to-one) iff for all 
w,x € X, if f(w) = f(x), then w = zx. In an arrow diagram for an injective 
function, every output gets hit by at most one arrow. A real-valued function is 
injective iff every horizontal line y = yo intersects the graph in at most one point. 


2. Surjective Functions. A function f : X — Y is surjective (onto) iff 
Vy € Y, Jx € X,y = f(x). In an arrow diagram for a surjective function, every 
output gets hit by at least one arrow. A real-valued function is surjective iff every 
horizontal line y = yo intersects the graph in at least one point. 


3. Bijective Functions. A function f : X — Y is bijective iff f is injective and 
surjective iff Vy € Y,3!x € X,y = f(x). In an arrow diagram for a bijective 
function, every output gets hit by exactly one arrow. A real-valued function is 
bijective iff every horizontal line y = yo intersects the graph in exactly one point. 


4. Properties of Injections, Surjections, and Bijections. The composition of two in- 
jective functions is injective. The composition of two surjective functions is sur- 
jective. The composition of two bijective functions is bijective. If go f is injective, 
then f is injective. If g o f is surjective, then g is surjective. 
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Exercises 


Í; 


10. 


11. 


Let A = {1,2,3,4}, B = {v,w, x,y,z}, and consider the following four functions: 
f: A— B with graph {(1, w), (2, z), (3, y), (4, w)}; 

g : B —> A with graph {(v, 4), (w, 3), (y, 1), (x, 2), (z, 4)}; 

h : A A with graph {(1, 4), (2, 4), (3, 1), (4,3)}; 

k: B > B with graph {(v, z), (z, y), (y, v), (w, x), (x, w)}. 

Decide whether each function is injective, surjective, or bijective. 


For each of the following functions f : R > R, is f injective? surjective? bijective? 
(a) f(z) = cosg. (b) f(x) = e™7. (c) f(z) = x4. (d) f(x) = Yx. (e) f(z) = 1/x 
for x £0, and f(0) = 0. (£) f(x) = [x] (x rounded up to the nearest integer). 
Let a # 0 and b be real constants. Prove that f : R > R, given by f(x) = ax +b 
for x € R, is a bijection. 


Define g : Rzo + Ryo by g(x) = 1/x for nonzero real x. Prove that g is a 
bijection. 

Define f : R > R by f(x) = cosa and define g : R > |-1,1] by g(x) =c 
Is each of the following functions injective? surjective? bijective? (a) f (b) g 
(c) F | [0,7] (d) 9\{0,7] (e) 9\{0,27] (£) gli3z/2,57/2] (g) fof (h) go (fl [o,1))- 

Give an example of a function f : Z>9 > Zso with the stated properties. Prove 
that your examples work. (a) f is injective but not surjective. (b) f is surjective 
but not injective. (c) f is not injective, not surjective, and not constant. (d) f is 
injective and surjective, but f is not the identity function. 


Determine whether each of the following functions is: (i) injective; (ii) surjective; 
iii) bijective. Prove your answers. 

a) f : R — R defined by f(x) = x? — zx. 

b) g : R > [1,00) defined by g(x) =e” 

c) h : R — {3/7} > R — {1/7} defined by h(x) = #45. 

d) k : Rzo > R defined by k(x) = ln |z]. 

e) p : (0,00) > (0,00) defined by p(x) = 2”. 


Determine whether each of the following functions is: (i) injective; (ii) surjective; 
iii) bijective. Explain informally. 

a) p : Z>0 > Z>o defined by p(n) = n? for n € Z>o. 

b) q: Z > Z defined by q(n) = n? for n € Z. 

c) r: R —> R defined by r(x) = x? for x € R. 

d) s : Z > Z defined by s(m) = m? for m € Z. 

e) t : R > R defined by t(x) = x? for x € R. 

f) u : C+ C defined by u(z) = 2? for z € C. 


Let f : X > Y be a function with graph F. Prove carefully that f is injective iff 
for all x,y,z, (x,y) € F and (z,y) € F implies z = z. 


(a) Is the function (Ø, Ø, Ø) injective? surjective? bijective? (b) Repeat part (a) for 
the function (Ø, B, Ø), where B is a nonempty set. 


For each part, find functions f,g : R > R satisfying the stated conditions. 


Relations and Functions 267 


12. Prove parts (c) and (d) of Theorem 5.93. 

13. Prove: for all f : X + Y, f is injective iff for all A,C C X, f[ANC] = f[A]A fIC]. 
14. Prove: for all f : X > Y, f is injective iff for all A,C C X, f[A—C] = f[A]— f[C]. 
15. Prove: for all f : X 3 Y, f is injective iff for all A C X, f-+[f[A]] = A. 

16. Prove: for all f : X — Y, f is surjective iff for all B CY, f[f~+[B]] = B. 


17. Suppose h: X —> X is a bijection. (a) Prove: for all functions f : X > Y, f is 
injective iff f oh is injective; f is surjective iff f o h is surjective; and f is bijective 
iff foh is bijective. (b) Prove: for all functions g : W > X, g is injective iff hog is 
injective; g is surjective iff hog is surjective; and g is bijective iff hog is bijective. 
(c) Which implications in (a) and (b) remain true if h is only assumed to be an 
injection? (d) Which implications in (a) and (b) remain true if h is only assumed 
to be a surjection? 


18. (a) Use uniqueness of prime factorization to show that g : Z>o x Z>0 > Z>0 
defined by g(a, b) = 23° is injective. (b) Generalize this idea to define an injection 
hy : Zo > Z>o for any fixed integer k > 1. (c) Define f : Z>o x Z>0 > Z>o 
by f(a,b) = 2°(2b + 1) — 1. Prove that f is a bijection. (d) Use the function f in 
part (c) to recursively construct bijections pp : ZX, > Z>o for all k > 2. 


19. Suppose f : X > Y is a function and $C X. 
(a) Prove: if f is injective, then f|s is injective. 
(b) Prove: if f|s is surjective, then f is surjective. 
(c) Give an example to show that the converse of (a) can be false. 
(d) Give an example to show that the converse of (b) can be false. 


art 


20. Consider a function given by the formula f(a) = erth for some real constants 
a,b,c,d such that ad—bc Æ 0. (a) Find the largest possible domain of f contained 
in R, and the smallest possible codomain corresponding to this domain. (Consider 
three cases: (i) a = 0; (ii) c = 0; (iii) a 4 0 # c.) (b) Prove that the function 
f with domain and codomain found in part (a) is a bijection. (c) Where in your 
proofs did you use the assumption ad — bc # 0? What happens if ad — bc = 0? 


21. Suppose f : A —> C and g : B > C are gluable functions. Let h = glue(f,g) : 
AU B > C. (a) Prove: If f or g is surjective, then h is surjective. (b) Prove: If 
h is injective, then f and g are injective. (c) Show that the converse of part (a) 
can be false. (d) Show that the converse of part (b) can be false. 


22. Suppose f : A> B, g : C > D, and ANC = BAD = Ẹ. Let h = glue( f, g) : 
AUC —> BUD. (a) Prove: h is injective iff f and g are injective. (b) Prove: 
h is surjective iff f and g are surjective. (c) Prove: h is bijective iff f and g are 
bijective. (d) What can be said if BN D # Ø? (e) What happens if ANC # 0? 

23. Suppose f : X — Y is a function. Prove: f is injective iff for all sets W and all 
functions g,h : W —> X, fog = f oh implies g =h. 

24. Suppose f : X —> Y is a function. Prove: f is surjective iff for all sets Z and all 
functions g,h : Y —> Z, go f = ho f implies g = h. 
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pet 
5.9 Inverse Functions 


In calculus, it is often helpful to pass from a function f to the inverse function f~!. The 
idea is that the inverse function interchanges the roles of inputs and outputs, so that for all 
inputs x and outputs y for f, y = f(x) iff x = f~+(y). A key subtlety is that many functions 
do not have inverse functions (in contrast to relations, where the inverse of a relation R 
is always another relation). It turns out that the inverse of a function f exists iff f is a 
bijection. The optional second half of this section introduces left inverses, right inverses, 
and their connections to injective and surjective functions. 


The Inverse of a Function 


The idea of interchanging inputs and outputs leads to the following definition of inverse 
functions. 


5.97. Definition: Inverse Function. Given a function f : X — Y, an inverse function 
for f is a function g : Y > X satisfying | Yx € X,Vy € Y,y = f(x) & z = gly). 


When such a function g exists, we say that f is invertible and write g = f—!. 


The next example shows that inverse functions do not always exist. But if a function 
does have an inverse, it is unique, as we show later. This justifies the introduction of the 
notation f7! for the inverse function. 


5.98. Example. Define f : {1,2,3} > {a,b,c} by f(1) = b, f(2) = c, and f(3) =a. An 
inverse function for f is the function g : {a,b,c} > {1,2,3} given by g(a) = 3, g(b) = 1, 
and g(c) = 2. On the other hand, consider h : {1,2,3} — {a,b} given by h(1) = b, 
h(2) = a, and h(3) = b. We claim h has no inverse function. To get a contradiction, 
assume k : {a,b} — {1,2,3} is an inverse for h. Since h(1) = b, we must have k(b) = 1. 
Since h(3) = b, we must also have k(b) = 3. Now k is not a function, since the input b is 
associated with two different outputs 1 and 3. Similarly, consider p : {1,2} > {a,b,c} given 
by p(1) = a and p(2) = c. If q : {a,b,c} — {1,2} were an inverse function for p, we must 
have q(b) = 1 or q(b) = 2. On one hand, if q(b) = 1, we deduce that p(1) = b, which is false. 
On the other hand, if g(b) = 2, we see that p(2) = b, which is false. So p has no inverse 
function. 


The next theorem shows how the concept of an inverse function is related to composition 
of functions and identity functions. The condition in this theorem is sometimes used as the 
definition of inverse functions. 


5.99. Theorem on Inverses and Composition. For all functions f : X — Y and 
g:Y — X, g is an inverse function for f iff fog = Idy and go f = Idx. 


Proof. Fix functions f : X —> Y and g : Y > X. Part 1. Assume g is an inverse function 
for f; prove fog = Idy and go f = Idx. Both f og and Idy are functions from Y to Y. 
To prove they are equal, we fix y € Y and show (f o g)(y) = Idy (y). Write x = gly) € X. 
By definition of inverse functions, we know y = f(x). Now compute (f o g)(y) = f(g(y)) = 
f(x) =y = Idy (y). A similar calculation shows that go f = Idx. 

Part 2. Assume f og = Idy and go f = Idx; prove Yx € X,Vy € Y,y = f(r) & xr = 
g(y). Fixx € X and y € Y. Part 2a. Assume y = f(x); prove x = g(y). We compute 
x = Idx(x) = (go f)(x) = g(f(x)) = g(y), as needed. Part 2b. Assume z = g(y); prove 
y = f(x). Here, we compute y = Idy (y) = (f o g)(y) = f(g(y)) = f(a), as needed. 
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5.100. Corollary: Uniqueness of Inverse Functions. Every f : X — Y has at most 
one inverse function. 


Proof. Fix f : X > Y; assume g : Y —> X andh: Y > X are both inverse functions for f; 
prove g = h. [The direct approach is to fix y € Y and prove g(y) = h(y). But here is a faster 
way using Theorem 5.99 and the identity and associativity properties from Theorem 5.62. 
We compute g = go Idy = go (f o h) = (go f)oh=Idx oh=h. 


Properties of Inverse Functions 


We are about to show that a function f is invertible iff f is bijective. First we give an 
intuitive version of the proof based on arrow diagrams. Given a function f : X —> Y, we 
know each x € X has exactly one arrow leaving it. We attempt to form the inverse function 
g: Y — X by reversing all the arrows. In order for g to be a function, it must be true that 
each y € Y has exactly one g-arrow leaving it. Since g-arrows are the reversals of f-arrows, 
this means that each y € Y should have exactly one f-arrow entering it. This is precisely 
the criterion for f to be a bijection. The next proof formalizes this argument. 


5.101. Theorem on Inverses and Bijections. For all f : X — Y, f has an inverse 
function iff f is a bijection. 


Proof. Fix f : X — Y. Part 1. Assume f has an inverse function g : Y —> X. Prove 
f is a bijection, which means f is surjective and injective. By Theorem 5.99, we know 
go f = Idx and fog = Idy. Now Idx is injective, so go f is injective, so f is injective 
by Theorem 5.93(d). Similarly, Idy is surjective, so f o g is surjective, so f is surjective by 
Theorem 5.93(e). 

Part 2. Assume f is a bijection (hence surjective and injective); prove f has an inverse 
function. Let F = {(x, f(x)) : x € X} be the graph of f. Define g = (Y, X, F71), where 
F-! = {(y,x) : (x,y) € F} is the inverse of the relation F. We prove below that g really 
is a function (with domain Y and codomain X). Granting this fact, we can then conclude 
that for all x € X and y € Y, x = g(y) iff (y, x) € F~+ iff (x,y) € F iff y = f(x), so that 
g is the required inverse function for f. To prove that g is a function, we check the three 
conditions appearing after Definition 5.35. First, F is a relation from X to Y (i.e., a subset 
of X x Y), so we know F™! is a relation from Y to X (i.e., a subset of Y x X). 

Second, to prove Existence of Outputs for g, we must prove Vy € Y, Jx € X, (y, x) € Fut. 
Note (y, x) € F`! iff (x,y) € F iff y = f(x). So we must prove Vy € Y, Jx € X,y = f(a). 
This is true since our assumption guarantees f is surjective. 

Third, to prove Uniqueness of Outputs for g, we must prove for all y, x, w, if (y, x) € F7! 
and (y, w) € F71, then z = w. Fix y,x,w. Assume (y,x) € Ft and (y, w) € F~+; prove 
x = w. Our assumption can be rewritten (x,y) € F and (w,y) € F, i.e., y = f(x) and 
y = f(w). Then f(x) = f(w), so x = w because f is injective. 


5.102. Example. The function f : R > R, given by f(x) = e” for x € R, is injective but 
not surjective, so it has no inverse. By restricting the codomain, we can obtain a bijective 
function exp : R + Rso that does have an inverse. The inverse is the natural logarithm 
function In: Ryo > R. In this setting, Theorem 5.99 states that In(e”) = x for all x € R, 
and e}! = y for all y € Ryo. 

The function f : R > R given by f(x) = x? for x € R is not a bijection, so it has no 
inverse. On the other hand, fı : R>o > R>o given by f1(x) = 2? for x € Ryo is a bijection; 
the inverse function is f7 (y) = \/y for y € Ryo. The function fz : R<o > Rso given by 
fo(x) = x? for x € R<o is also a bijection; the inverse satisfies fy '(y) = —/y for y € Rso. 
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Next consider h : R —> R given by h(x) = sin x. The function h is neither injective nor 
surjective, so it has no inverse. By restricting the domain and codomain of h, we can create 
a new bijective function S : [—1/2, 7/2] > [—1,1] given by S(x) = sin z for x € [—1/2, 7/2]. 
The inverse function S~! : [—1,1] > [-7/2, 7/2] is called the arcsine function in calculus. 
5.103. Theorem on Invertible Functions. For all functions f : X + Y and g: Y > Z: 
(a) Idx : X + X is invertible, and Id5' = Idx. 

(b) If f is invertible, then f~! is invertible and (f~!)~1 = f. 

(c) If f and g are invertible, then go f is invertible and (go f)~! = f-'og™t. 

(d) If f is a bijection, then fT! exists and is a bijection. 

Proof. Fix functions f : X —> Y and g : Y — Z. We prove each part using the criterion 
in Theorem 5.99. For (a), we take Y = X and f = g = Idx in that theorem. Since 
“Idx oIdx = Idx and Idx oIdx = Idx” is true, the theorem tells us that Idx is the 
inverse function for Idx. To prove (b), assume f is invertible. We know “f~!o f = Idx and 
fof! = Idy” is true. Now use Theorem 5.99, replacing X by Y, Y by X, f by f7!, and 
g by f. We deduce that f is an inverse function for f~t, i.e., (f~!)~! = f. To prove (c), 
assume f and g are invertible. By Theorem 5.99 (replacing Y by Z, f by go f, and g by 
f-tog7), it suffices to prove that (go f)o(f~tog7!) = Idz and (f~'og™!)o(gof) = Idx. 
To prove the first equation, use associativity of function composition to compute 


(gof)o(f-tog”*) = go(fo(ftog*)) = go((fof*)og~*) = go(Idy og~*) = gog™* = Idz . 


The second equation is proved similarly. Finally, part (d) follows from part (b) and the fact 
that f is invertible iff f is bijective. 


Left Inverses (Optional) 


Theorem 5.99 tells us that g is an inverse function for f iff fog and go f are both identity 
functions. We now investigate what happens when only one of these conditions holds. 


5.104. Definition: Left Inverses. For all functions f : X > Y and g : Y > X, g is a left 
inverse for f iff go f = Idx. 


5.105. Example. Figure 5.10 gives arrow diagrams for two functions f and h. A left inverse 
for f is a function g : {a,b,c,d,e} > {1,2,3} satisfying (g o f)(x) = Idx(x) = x for all 
x € {1,2,3}. Taking x = 1,2,3, we see that g must satisfy g(d) = 1, g(b) = 2, g(a) = 3. 
However, no requirement has been imposed on g(c) and g(e), since c and e are not in the 
image of f. We obtain one left inverse for f by setting g(c) = 1 and g(e) = 1. We could 
build other left inverses for f by choosing other values for g(c) and g(e). In this example, f 
has nine different left inverses. So, unlike the two-sided inverses studied earlier, left inverses 
need not be unique. Also, no left inverse g can satisfy f o g = Idy, since f is not a bijection. 
For instance, the g chosen above has (f o g)(c) = f(g(c)) = f1) = d # c = Idy (6). 

Furthermore, left inverses do not always exist. Suppose we try to find a left inverse 
g : {a,b,c} — {1,2,3, 4,5,6} for the function h in the figure. Writing out the requirement 
go h = Idx, we obtain (among other conditions) g(h(2)) = 2 and g(h(5)) = 5. Now 
h(2) = c = h(5), so we must have g(c) = 2 and g(c) = 5. But then g is not a function, since 
input c has two associated outputs. This shows that h does not have any left inverses. The 
trouble arises because h was not injective: distinct inputs 2 and 5 to h both mapped to the 
same output c. 


The previous example illustrates the following characterization of which functions have 
left inverses. 
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FIGURE 5.10 
Arrow diagrams of two functions. 


5.106. Theorem on Left Inverses. Let X be a nonempty set. For all functions 
f[:X-Y, f has a left inverse iff f is injective. 


Proof. Fix f : X — Y, where X is nonempty. First assume f has a left inverse g : Y > X. 
Since go f = Idx is an injective function, Theorem 5.93(d) tells us that f is injective. 

Conversely, assume f is injective; we must construct a left inverse g : Y —> X. To do so, 
let Z = f[X] C Y by the image of f, and define fı : X > Z by fi(x) = f(x) for alla € X. 
The function fı (obtained by restricting the codomain of f) is injective and surjective, so 
it has a two-sided inverse gı : Z —> X. In particular, gı o fı = Idx. To build the function 
g : Y — X, select one fixed element xo € X; this is possible since X is nonempty. Now 
define g(y) = gi(y) for y € Z, and g(y) = xo for y € Y — Z. You can check that g really 
is a function from Y to X. For instance, this is a special case of the gluing construction 
in Definition 5.68; here we are gluing together gı and the constant function with domain 
Y — Z and constant value zo. 

To finish, we must check that go f = Idx. Fix x € X, and set y = f(x), which is in 
the image Z of f. We now compute (g o f)(«) = g(f(2)) = 9(y) = aly) = m(file)) 
(91° fı)(x) = Idx (z). 


II 


Right Inverses (Optional) 


5.107. Definition: Right Inverses. For all functions f : X > Y and g: Y —> X, gisa 
right inverse for f iff f o g = Idy. 


5.108. Example. Consider once again the functions f and h in Figure 5.10. A function 
g : {a,b,c} > {1, 2, 3,4,5,6} is a right inverse for h iff h o g = Idta b,c} iff h(g(a)) = a and 
h(g(b)) = b and h(g(c)) = c. Now, h(x) = a is true when z is 1, 4, or 6. So we could define 
g(a) to be any one of these three values and thereby satisfy h(g(a)) = a. Similarly, h(x) = b 
is solved by x = 3, and h(x) = c is true when <v is 2 or 5. Thus, g is a right inverse for 
h iff g(a) € {1,4,6} = h7*[{a}] and g(b) € {3} = h7*[{b}] and g(c) € {2,5} = h7"[{c}]. 
For example, one right inverse is given by g(a) = 1, g(b) = 3, and g(c) = 2. Another right 
inverse is given by gi(a) = 4, gı (b) = 3, and gi(c) = 5. In this example, there are 3-1-2 = 6 
possible right inverses for h. So, right inverses need not be unique. Also, no right inverse g 
can satisfy go h = Idx, since h is not a bijection. For instance, the right inverse g chosen 
above has (g o h)(4) = g(h(4)) = g(a) = 1 # 4 = Idx (4). 

Furthermore, right inverses do not always exist. Suppose we try to find a right inverse 
g : {a,b,c, d,e} — {1,2,3} for the function f in the figure. Writing out the requirement 
fog = Idy, we obtain (among other conditions) f(g(c)) = c. But there is no æ in the 
domain of f solving f(x) = c, since f~'[{c}] = 0, so it is impossible to define g(c) in a way 
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that makes f(g(c)) = c true. So f does not have any right inverses. This happened because 
f was not surjective: there exist points (such as c and e) in the codomain of f that do not 
have the form f(x) for any x in the domain of f. 


The previous example illustrates the following characterization of which functions have 
right inverses. 


5.109. Theorem on Right Inverses. For all functions f : X — Y, f has a right inverse 
iff f is surjective. 


Proof. Fix f : X —> Y. First assume f has a right inverse g: Y —> X. Since f og = Idy is 
a surjective function, Theorem 5.93(e) tells us that f is surjective. 

Conversely, assume f is surjective; we must construct a right inverse g : Y > X. 
Informally, we can build g as follows. For each y € Y, we know that the preimage f~*[{y}] 
is a nonempty subset of X, because f is surjective. Define g(y) to be any particular element 
of this nonempty set (chosen arbitrarily). To check that fog = Idy, fix y € Y, and compute 
(fog)(y) = f(g(y)) = y = Idy (y). The equality f(g(y)) = y is true because g(y) € f~"[{y}].- 
Thus the proof appears to be complete. 

[This paragraph should be considered optional.] There is a technical subtlety, however. 
To give a completely rigorous definition of g, we must construct the graph of g, which is a 
certain set of ordered pairs G = {(y, g(y)) : y € Y}. To prove that this set exists working 
within axiomatic set theory, we must use the Axiom of Choice (see §3.7). Specifically, let Z 
be the collection of all sets f~'[{y}], for y € Y. The set Z exists by the Power Set Axiom 
and the Axiom of Specification, since we can write Z = {S € P(X) : 3y € Y, S = f-'[{y}}}. 
Since f is surjective, the empty set is not a member of Z. As discussed in §3.7, the Axiom 
of Choice provides us with a set G” C Z x X such that for all (S,x) € G’, x € S, and 
for each S € Z, there exists a unique x € X with (S,x) € G’. (The set G” is the formal 
mechanism by which we choose one element x from each nonempty set S = f~'[{y}].) Now 
define G = {(y,x) €E Y x X : IS € Z, S = f !Hy} A (S, x) € G’}. It must now be checked 
that g = (Y, X, G) is a function; the key point is that for each y € Y, there exists a unique 
S € Z with S = f~t[{y}], and hence there exists a unique x € X with (y, x) € G. Finally, 
by definition of G and Z, (y, x) € G implies x € fTtH{y}, so that f(g(y)) = f(x) = y for 
each y € Y. 


Section Summary 


1. Inverse Functions. Given f : X —> Y, a function g : Y > X is an inverse function 
for f if Yx € X,Yy € Y,y = f(x) © x = g(y). Equivalently, g is an inverse 
function for f if f og = Idy and go f = Idx. Inverse functions are unique when 
they exist; the inverse of f is denoted f~?. 

2. Properties of Invertibility. A function f : X — Y is invertible iff f is a bijection. 
Identity functions are invertible. The inverse of an invertible function is invertible. 
The composition of invertible functions f : X —> Y and h : Y —> Z is invertible, 
and (ho f) t= ftoht. 

3. Left Inverses. A left inverse of f : X — Y is a function g : Y > X such that 
go f = Idy. Assuming X is nonempty, f has a left inverse iff f is injective. Left 
inverses are not unique in general. 

4. Right Inverses. A right inverse of f : X —> Y is a function g : Y > X such that 
fog= Idy. A function f has a right inverse iff f is surjective. Right inverses are 
not unique in general. 
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Exercises 


Í; 


Let A = {1,2,3,4}, B = {v,w, x,y,z}, and consider the following four functions: 


f:A—>B_ definedby f(1)= w, f(2)= z, f(3) = y, f(4 = w; 


) 
g:B—+A_ definedby g(v)= 4, g(w) = 3, gly) = 1, g(x) = 2, glz) = 4; 
h:A—+ A defined by A(1) = 4, h(2) = 4, A(3) = 1, h(4) = 3; 
k:B—-+B_ defined by k(v) =z, k(z) =y, k(y) = v, k(w) = x, k(x) = w 


Which functions have: (a) a left inverse? (b) a right inverse? (c) a two-sided 
inverse? Give an example of each type of inverse when it exists. 


For each of the following functions from calculus, say whether the function has a 
left, right, or two-sided inverse. Give an example of each type of inverse when it 
exists. 


8 
wy 
A 
4 


R>o given by f(x) = x°. 

: R > R given by f(x) = cosa. 

: R > [-1,1] given by f(x) = cos z. 

: (0,7/2) > R given by f(x) = cos z. 

: [0,7] + [-1,1] given by f(x) = cos z. 

: R > R given by f(x) = x3 — z. 

: R>o > R given by f(x) = 1/(1 +27). 

: R<o > (0, 1] given by f(x) = 1/(1 + 27). 

= {1,2,3,4,5,6} and B = {w, x,y,z}. Consider the functions: 


o` 
~ 


L SaS 


f:A—>B_ with graph {( 
g:B—-A_ with graph {( 
h:A—=>A with graph {( ; 
k:B—>B_ with graph {(w, w), (2, w), (y, w), (z,z)}. 


(a) For each function, find all possible left inverses for the function, or explain 
why none exist. (b) Repeat part (a) for right inverses. (c) Repeat part (a) for 
two-sided inverses. 


For each of the following functions, say whether the function has a left, right, or 
two-sided inverse. Give an example of each type of inverse when it exists. 

a) f: Z > Z given by f(n) = 2n. 
b) f : Z>0o > Zso given by f(n) =n+1. 
c) f: ZZ given by f(n) =n+1. 
d) f : Z > Z given by f(n) = —n. 

e) f : Z>0 > Z>o given by f(n) = n?. 

f) f :Z>0 > Z given by f(n) =n. 

g) f : Z>0 > Zso given by f(0) = 0 and f(n)=n-— 1 for n > 0. 

Define f : Z>o > {0,1,2,...,9} by letting f(n) be the last digit in the base-10 
expansion of n. For example, f(1407) = 7. Describe three different right inverses 
for f. 


(a) Finish Part 1 of the proof of Theorem 5.99 by showing that go f = Idx. 
(b) Finish the proof of part (c) of Theorem 5.103. 


Let B be a fixed set, and let f : Ø —> B be the empty function. Under what 
conditions does f have a left, right, or two-sided inverse? 


Suppose f : A — R is a function. Find necessary and sufficient conditions to 
ensure that there exists a function g : A —> R such that f -g =g- f =1. (The 
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dot denotes pointwise product of functions, and 1 denotes the constant function 
on A with value 1.) 


9. Suppose f : R — R is a function. Find necessary and sufficient conditions to 
ensure that there exists a function g : R —> R such that f -g =g- f = Idg. 


10. Suppose f : R > R is a function. (a) Under what conditions does there exist 
g : R —> R such that f o g = 1 (the constant function 1)? Describe all functions 
g that satisfy this property (when such g exist). (b) Under what conditions does 
there exist g : R > R such that go f = 1? Describe all functions g that satisfy 
this property (when such g exist). 


11. Suppose f : X — Y and g,h: Y — Z are functions. Prove: if f has a right 
inverse and go f = ho f, then g = h. Give an example to show that this right 
cancellation property can fail if f does not have a right inverse. 

12. Suppose f : X —> Y and g,h:W — X are functions. Prove: if f has a left inverse 
and fog = f oh, then g = h. Give an example to show that this left cancellation 
property can fail if f does not have a left inverse. 

13. (a) Suppose f : X > Y and g : Y — Z both possess left inverses. Show that go f 
also has a left inverse. (b) Prove a similar result for right inverses. 

14. Suppose n € Zso, Xo,...,Xn are sets, and fi : X;-1 — X; are bijections for 
1 <i <n. Use induction on n to prove that (fn o+- o f1)~! exists, and 
(fno o fao A) = fro fg +0 fy’. 

15. Prove: for all sets X, Y and all f : X > Y, g: Y — X, g is a left inverse of f iff 
(Vx € X, Yy € Y,y = f(x) > x = g(y)). 

16. Prove: for all sets X, Y and all f: X > Y, g: Y — X, g is a right inverse of f 
if (Va € X, Vy € Y, z = g(y) => y = f(z)). 

17. Use right inverses to prove: for all nonempty sets X, Y, Z, all f : X > Y, and 
all g: Y —> Z, if go f is one-to-one and f is onto, then g is one-to-one. 

18. Use left inverses to prove: for all nonempty sets X, Y, Z, all f : X — Y, and all 
g: Y — Z, if go f is surjective and g is injective, then f is surjective. 

19. Suppose f is an injection mapping an n-element set into an m-element set, where 
m > n. How many left inverses does f have? 

20. Let f: X — Y bea surjection. Suppose Y = {y1, Y2,..., Ym}, and suppose that, 
for 1 < i < m, there are exactly n; elements x € X with f(x) = y;. How many 
right inverses does f have? 

21. Let X be an n-element set. How many bijections are there from X onto X? 


22. Suppose f : X — Y is a function with graph F. Prove that the inverse relation 
F~' is the graph of a function (not necessarily a function from Y to X) iff f is 
injective. 

23. Suppose f : X —> Y is a function with graph F. Prove that the inverse relation 
F~' contains the graph of some function g : Y + X iff f is surjective. 


24. Prove: if f : X — Y has exactly one right inverse g, then f is invertible and 
g= f. 

25. If f: X — Y has exactly one left inverse, must f be invertible? What if X has 
more than one element? 


6 


Equivalence Relations and Partial Orders 


6.1 Reflexive, Symmetric, and Transitive Relations 


In this chapter, we continue our study of relations. So far, we have been thinking of a 
relation R as a generalization of a function, where (x,y) E€ R means that the input «x is 
associated with the output y under R. Now, we change our viewpoint slightly, interpreting 
(x,y) E€ R to mean that x is “related” to y in some way, without insisting on the notion 
that x is the “input” and y is the “output.” For example, (x,y) € R might mean z is equal 
to y, or x is greater than y, or x is a subset of y, or x is a sibling of y, and so on. We begin 
this section by developing a new way of visualizing R, called a digraph, that emphasizes 
this “relational” viewpoint. Later, we will be especially interested in relations that share 
certain properties with the equality relation, called reflexivity, symmetry, and transitivity. 
We define these properties here and give many examples. 


Digraph of a Relation 


Previously, we looked at relations R from X to Y, where X was an “input set” and Y was 
an “output set.” Now, we focus attention on a single set of objects X, thinking of R as 
specifying a relationship that may or may not hold between any two given objects in X. 
This leads to the following definitions. 


6.1. Definition: Relation on a Set X. For all sets X and R, |R is a relation on X | iff 
RC X x X |. For all objects x and y, we write | «Ry | if | (x,y) € R|. The notation «Ry is 
called infix notation and can be read “zx is related to y under R.” 


6.2. Example. Let X = {1,2,3}. The following sets of ordered pairs are relations on X: 
Ry = {(1, 1), (2, 2), (3,3); Rə = {(1,2), (2,3), (1,3)}; Rs = Rı U Ro; 


R, = {(1, 3), (2, 2), (3, 1)}; Rs = {(1,1), (1,2), (1,3)}; Re = {(1, 3), (2, 3)}- 


Relation Rı was formerly called the identity relation Ix on the set X. Now, we can think of 
R, as the equality relation on this set, since for all x,y € X, (x,y) € Rı iff x = y. Similarly, 
Rə is the less-than relation on X, since for all x,y € X, (x,y) € Ro iff x < y. Next, for all 
x,y E€ X, xRgy iff x < y. We see that the notation xRy for relations imitates the way we 
write arithmetical relations such as =, <, and <. 

Turning to R4, we have zR4y iff x + y = 4, for all x,y € X. As for Rs, note xRs5y iff 
x = 1, so that this “relation” between x and y does not actually involve y. Relation Rg does 
not describe any particularly meaningful relationship between x and y; it illustrates that a 
relation on X can be any set of ordered pairs contained in X x X. 


Earlier, we introduced arrow diagrams and (Cartesian) graphs as ways to visualize rela- 
tions from X to Y. When R is a relation on a finite set X, we can instead draw a modified 
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FIGURE 6.1 
Digraphs for six relations on X = {1,2,3}. 


version of the arrow diagram called the digraph (directed graph) of the relation. We make 
a single copy of the set X, drawing a dot labeled x for each object x € X. For each ordered 
pair (x,y) € R, we draw an arrow from z to y. If y = a, this arrow becomes a little loop that 
starts and ends at x. Figure 6.1 displays the digraphs for the six relations in Example 6.2. 


6.3. Example. Define a relation R on the set X = {1, 2,3, 4,5,6} by 
R= {(, 1), (2, 2), (3, 3), (4, 4), (2, 5), (5, 2), (1, 3), (3, 4), (4, 1), (2, 6)}. 


Figure 6.2 displays both the arrow diagram and the digraph for this relation. We will see 
that certain structural features of R can be seen more easily in the digraph picture. 


arrow diagram for R digraph for R 


FIGURE 6.2 
Arrow diagram and digraph for a relation R. 


Reflexive Relations 


Now we introduce three properties of abstract relations that are based on corresponding 
properties of logical equality. The first property, called reflexivity, comes from the fact that 
x = for all objects x. 
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6.4. Definition: Reflexive Relations. For all relations R and all sets X: 


R is reflexive on X | iff} Va € X,(a,x) € R|. 


Using infix notation, we can also say that R is reflexive on X iff for all x € X, «Rea. 


Informally, R is reflexive on X iff every object x in X is related to itself under R. This 
means that the digraph for R has a loop at every vertex in the set X. Applying the negation 
rules, we see that | R is not reflexive on X | iff | dae X, (a,x) Z R | 


6.5. Example. Relations Rı and Rg in Figure 6.1 are reflexive on X = {1,2,3}, but the 
other four relations are not. In particular, even though (2,2) € R4, R4 is not reflexive on 
X because (1,1) is not in R4. Similarly, the relation R in Figure 6.2 is not reflexive on 
the set X = {1,2,3,4,5,6} because (5,5) ¢ R. However, R is reflexive on the smaller set 
X’ = {1,2,3,4}, although R is not a relation on X’. Thus, reflexivity is a property of 
both the relation R and the set X. Usually, however, we are dealing with relations on a 
fixed set X, and we are only interested in reflexivity with respect to the full set X. In such 
circumstances, we often say “R is reflexive” as an abbreviation for “R is reflexive on X.” 


6.6. Example. Suppose X is any collection of sets, and R is the relation on X defined by 
(A, B) € Riff A C B, for all A, B € X. This “subset relation” R is reflexive on X, because 
A C A for every set A. 


6.7. Example. For any set X, the equality (or identity) relation Ix = {(x, x) : x € X} is 
reflexive on X. The product set X x X is a relation on X that is reflexive on X. The empty 
set Í is a relation on X that is not reflexive on X, assuming X itself is nonempty. 


6.8. Example. Define relations R = {(z,y) € Z? : x+y is even}, S = {(z,y) € [-1,1]?: 
x? +y? = 1}, and T = {(A, B) € P(Z)? : 8 ¢ A— B}. Determine (with proof) which of 
these relations are reflexive. Solution. First, we show R is reflexive on Z. Fix x € Z; we 
must prove (a,x) € R, which means x + x is even. Since z + x = 2x where x is an integer, 
x +a is even. Second, we show S is not reflexive on [—1, 1]. Pick z = 1 € [1,1], and note 
(z,x) Z S because x? + x? = 2 Æ 1. Third, we show T is reflexive on P(Z). Fix A € P(Z); 
we must prove (A, A) € T, which means 8 ¢ A — A. Since A — A = @ by the Theorem on 
Set Equality, 8 is not in this set. 


Symmetric Relations 


We next generalize the symmetry property of equality, which says that for all objects x,y, 
if x = y then y = zx. 


6.9. Definition: Symmetric Relations. For all relations R: 


R is symmetric | iff | Yz, y, (x,y) € R> (y, z) E Ri. 


Using infix notation, we can also say R is symmetric iff for all z and y, xRy implies yRz. 


Thus, symmetry of R means that whenever x is related to y, y is also related to z. 
Pictorially, this means that for any arrow from x to y in the digraph for R, the reversed 
arrow from y back to x must also appear in the digraph. Applying the negation rules, we 


see that | R is not symmetric | iff | 3x, Jy, (x,y) E RA (y, x) Z R) 


6.10. Example. In Figure 6.1, relations Rı and R4 are symmetric, but the other relations 
are not. For instance, Rə is not symmetric because (1,2) € R2 but (2,1) ¢ R2. The relation 
R in Figure 6.2 is not symmetric because, for instance, (2,6) € R but (6,2) ¢ R. 


278 An Introduction to Mathematical Proofs 


6.11. Example. Let X be the set of all subsets of {1, 2,3}, and let R be the subset relation 
given by aRb iffa C b (for a,b € X). Ris not symmetric, since {1} C {1,2} but {1,2} Z {1}. 


6.12. Example. For any set X, the equality relation Ix is symmetric. X x X is also 
symmetric. The empty relation @ is symmetric, since the implication (x, y) € Ø > (y,z) € 0 
has a false hypothesis and is therefore true. The relation < on R is not symmetric since 
3 < 5 is true but 5 < 3 is false. 


6.13. Example. Decide (with proof) whether the relations R, S, and T from Example 6.8 
are symmetric. Solution. First, we prove R is symmetric. Fix arbitrary (x, y); assume (x, y) € 
R; prove (y,z) € R. We have assumed x,y € Z and x+y is even; we must prove x,y € Z and 
y+za is even. Since x+y = y +x, the conclusion holds. Second, we prove S is symmetric. Fix 
arbitrary (x,y) € R?; assume (x,y) € S; prove (y, x) € S. We have assumed x? +y? = 1; we 
must prove y? + z? = 1. Since z? + y? = y? +2”, this conclusion holds. Third, we prove T 
is not symmetric. Choose A = {1, 2,3} and B = {6,7,8}, which are elements of P(Z). Note 
(A, B) € T, since A — B= A, and 8 ¢ A. But (B, A) g T, since B — A= B, and 8 € B. 


Transitive Relations 


The transitive property of equality says that for all x,y,z, if z = y and y = z, then x = z. 
This leads to the following definition. 


6.14. Definition: Transitive Relations. For all relations R: 


R is transitive | iff | Va, y, z, if (x,y) € R and (y, z) € R, then (z,z) € R|. 


Using infix notation, we can say R is transitive iff for all x,y,z, if xRy and yRz, then zRz. 


Transitivity means that whenever z is related to y and y is related to z, then x must be 
related to z. To detect transitivity from a digraph, we must check that whenever there are 
arrows from z to y to z, there is also an arrow directly from x to z. We must be sure to 


check all cases, including those where x = z. By the negation rules, | R is not transitive | iff 
Jr, y, z, (x,y) E RA (y, z) E€ RA (z,2) € RI. 


6.15. Example. All of the relations in Figure 6.1 are transitive except for R4. We see 
that R4 is not transitive because (1,3) € R4 and (3,1) € R4, but (1,1) ¢ Ry. Relation Re 
is transitive because there are no choices for x,y,z that make the hypothesis (x,y) € Rg 
and (y,z) € Rg true; so the IF-statement defining transitivity is true. The relation R in 
Figure 6.2 is not transitive because, for example, (5,2) € R and (2,6) € R but (5,6) Z R. 


6.16. Example. If X is any collection of sets, the subset relation C on X is transitive since 
AC Band BCC imply A C C. For any set X, the equality relation Ix is transitive, as 
is the relation X x X. The empty set is a transitive relation, by the definition of IF. The 
relation < on R is transitive as well. 


6.17. Example. Decide (with proof) whether the relations R, S, and T from Example 6.8 
are transitive. Solution. First, we show R is transitive. Fix x,y,z € Z, assume (x,y) € R 
and (y,z) € R, and prove (x,z) € R. We have assumed x + y is even and y + z is even, 
which means x +y = 2a and y+ z = 2b for some integers a,b. We must prove x + z is even, 
which means dc € Z, x +z = 2c. Compute x + z = (2a — y) + (2b — y) = 2(a +b — y), where 
a+b-—y €Z. So we may choose c = a + b — y to see that x + z is even. Second, we show 
S is not transitive. Choose x = 0, y = 1, and z = 0. Then (x,y) € S because 0? + 1? = 1; 
(y, z) € S because 1? + 0? = 1; but (x,z) ¢ S because 0? + 0? 4 1. Third, we show T is 
transitive using a proof by contradiction. Assume, to get a contradiction, that there exist 
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A, B,C € P(Z) with (A, B) € T and (B,C) € T and (A,C) ¢ T. By definition, this means 
8 Z A— B and 8 ¢ B-— C and 8 € A— C. The last condition tells us that 8 € A and 8 £ C. 
Since 8 is in A but not in A — B, we must have 8 € B. But now 8 is in B and not in C, so 
8 € B — C. This contradicts the assumption 8 ¢ B — C. 


6.18. Remark. A relation R on a fixed set X may be reflexive or not, symmetric or not, and 
transitive or not. In general, all eight possible combinations can occur (see the exercises). 


Properties of Reflexivity, Symmetry, and Transitivity 
The next theorem lists some facts about reflexivity, symmetry, and transitivity. 


6.19. Theorem on Reflexivity, Symmetry, and Transitivity. For all relations R and 
all sets X: 

(a) R is reflexive on X iff Ix C R. 

(b) R is reflexive on X iff R7! is reflexive on X. 

(c) Any union or intersection of reflexive relations on X is reflexive on X. 

(d) R is symmetric iff R7! is symmetric. 

(e) R is symmetric iff RC R7! if R = Ro. 

(£) Any union or intersection of symmetric relations is symmetric. 

(g) R is transitive iff R~! is transitive. 

(h) R is transitive iff Ro RC R. 

(i) Any intersection of transitive relations is transitive. 

Proof. We prove a few parts here and leave the rest as exercises. Fix a relation R and a 
set X. To prove (b), we first assume R is reflexive on X and prove R`! is reflexive on 
X. Fix x € X; prove (x,x) € R-+. Because R is reflexive on X, we know (2,2) € R. 
Switching the order of the components, we conclude (x, x) € R~! as needed. The converse 
is proved similarly, or it can be deduced from the IF-statement we just proved by replacing 
the arbitrary relation R by R`} and recalling (R~')~! = R. 

Next we prove that the union of any collection of symmetric relations is symmetric (part 
of (f)). Let {R; : i € I} be a collection of relations, and define S = (Jier Ri- Assume R; 
is symmetric for all i € I; prove S is symmetric. Fix objects x,y; assume (x,y) E€ S; prove 
(y,x) € S. We know there exists i € I with (x,y) € Ri. Since R; is symmetric, it follows 
that (y, x) € Ri. By definition of union, (y,x) € S, as needed. 

Finally, we prove that if R is transitive, then Ro R C R (part of (h)). We have assumed 
Yz, y, z, (x,y) E€ RA (y,z) E€ R => (a,z) € R. To prove Ro R C R, fix an ordered pair 
(u,v) € Ro R and prove (u,v) € R. By definition of composition of relations, there exists 
w with (u, w) € R and (w,v) € R. By transitivity of R, (u,v) € R follows. 


6.20. Example. It is false that the union of transitive relations must always be transitive. 
For example, let R = {(1,2)} and S = {(2,3)}, so RUS = {(1, 2), (2,3)}. Both R and S$ 
are transitive relations, but their union is not. 


Section Summary 


1. Relations and Digraphs. R is a relation on a set X iff R C X x X. In this case, 
the digraph of R consists of a point for each x € X, and an arrow from z to y for 
each ordered pair (x,y) € R. 


Reflexive Relations. R is a reflexive relation on X iff for all x € X, (a,x) € R. 


3. Symmetric Relations. A relation R is symmetric iff for all a,b, if (a,b) € R, then 
(b,a) E€ R. 
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4. Transitive Relations. A relation R is transitive iff for all a,b,c, if (a,b) E€ R and 
(b,c) E R, then (a,c) E€ R. 


5. Infix Notation. Given a relation R, aRb means (a,b) € R. In this notation, R is 
reflexive on X iff Vx € X,aRa; R is symmetric iff Va,b,aRb => bRa; and R is 
transitive iff Va, b, c, (aRb A bRc) > aRc. 


6. Properties of Reflexivity, Symmetry, and Transitivity. If R is reflexive on X, or 
symmetric, or transitive, then R71 has the same property. Reflexivity, symmetry, 
and transitivity are inherited by intersections of relations. Reflexivity and sym- 
metry are inherited by unions of relations, but transitivity is not. R is reflexive 
on X iff Ix C R; R is symmetric on X iff R C Ro! iff R= Rt; R is transitive 
on X iff RoRCR. 


Exercises 


1. The digraphs of various relations are shown in Figure 6.3. Is each relation reflexive 
on X? symmetric? transitive? 


FIGURE 6.3 
Digraphs for some relations. 


2. Draw an arrow diagram and a digraph for each relation. Is each relation reflexive 
on X = {1,2,3,4}? symmetric? transitive? 
a) R= {(1,1), (2, 2), (3, 3), (4, 4), (1, 3), (3, 4), (1, 4), (2, 4)}. 
b) R = {(1, 2), (2, 1), (3, 4), (4,3)}. 
c) R= (X x X)— Ix. 
d) R = {(1,1), (3,3), (1,3), (3, 1), (2, 2), (4, 4), (2, 4), (4, 2)}. 
3. Is each relation reflexive on Z? symmetric? transitive? Explain. 
a) R={(a,b) € Z? :a <b}. 
b) R= {(a,b) € Z? : a? < b?}. 
R = { (a,b) € Z? : ab is even}. 
d) R = {(a,b) € Z? : |a — b| < 1}. 
R = {(a,b) € Z? : a divides b}. 


Equivalence Relations and Partial Orders 281 


(£) R = {(a,b) € Z? : 5 divides ab}. 
(g) R={(a,b) € Z? : 7 divides a — b}. 

4. Let S be a relation on R. (a) Describe how to determine if S' is reflexive on R by 
visual inspection of the graph of S in the xy-plane. (b) Similarly, describe how 
to determine if S' is symmetric by inspection of its graph. 


5. Is each relation reflexive on R? symmetric? transitive? Explain. 
a) R={(z,y) € R? : cosx = cosy}. 
b) R= mc y) € R®: ay = 1}. 
y) ER? :y=aeVy=—2Vart+y? =1}. 
y) E€ R? : zy < 0}. 
(x,y) € R? : zy > 0}. 
(x,y) € R? : xy > 0} U {(0,0)}. 
= (9) ER?: aaa 


ati 

R(c,d) means a 24 52 =?4+d*. 

R(c, d) means ad = be. 

R(c,d) means a +d = b +c. 

4) a tae d) means a < c or (a = c and b < d). 

7. How can you decide by looking at the digraph of a relation G on X whether G is 
the graph of a function f : X > X? 


8. Let X = {a,b,c}. For each part, give two different examples of relations on X 
with the indicated properties (if possible). 
a) R is reflexive on X and symmetric and transitive. 
b) R is reflexive on X and symmetric and not transitive. 
c) R is reflexive on X and not symmetric and transitive. 
d) R is reflexive on X and not symmetric and not transitive. 
e) R is not reflexive on X and symmetric and transitive. 
f) R is not reflexive on X and symmetric and not transitive. 
g) R is not reflexive on X and not symmetric and transitive. 
h) R is not reflexive on X and not symmetric and not transitive. 


9. Which parts of the previous exercise can be solved (with either one or two exam- 
ples) using relations on the two-element set X = {1,2}? 

10. Prove (a) and (c) of Theorem 6.19. 

11. Prove (d), (e), and the second part of (f) in Theorem 6.19. 

12. Prove (g), (i), and the second part of (h) in Theorem 6.19. 

13. (a) Suppose R is a reflexive relation on a nonempty set X. What can you say 
about whether (X x X)— R is reflexive on X? (b) Repeat (a) replacing “reflexive” 
by “symmetric.” (c) Repeat (a) replacing “reflexive” by “transitive.” 

14. (a) If R and S are reflexive relations on X, must Ro S be reflexive on X? (b) If 
R and S are symmetric relations, must R o S be symmetric? (c) If R and S are 
transitive relations, must Ro S be transitive? 

15. Prove: for any relation R and any set X, RU R7! U Ix is reflexive on X and 
symmetric. 

16. (a) For arelation R on X, prove that the intersection T of all transitive relations 
on X containing R is a transitive relation on X containing R. (b) Recursively 
define R! = R and R"*! = R” o R for n € Zs}. Prove: for all n,m € Z>1, 
R”+™ = R” o R”. (c) Prove that T = UZ] R 
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6.2 Equivalence Relations 


In mathematics, it often happens that two objects are equivalent in a certain respect, even 
though the objects in question are not equal to each other. For instance, two triangles 
(thought of as subsets of R?) can be geometrically congruent even if they are not equal 
as sets of points. As further examples, two unequal shapes may be geometrically similar; 
two unequal vectors may have the same length; two unequal lines may have the same 
slope; two unequal times (differing by a multiple of 24 hours) may represent the same hour 
of the day; two unequal real numbers (differing by a multiple of 27) may represent the 
same geometric angle; and so on. The abstract definition of an equivalence relation gives 
a precise mathematical formulation of various intuitive notions of equivalence. We define 
equivalence relations in this section and consider many examples, including congruence of 
integers modulo n and the equivalence relation induced by a function. 


Definition and Examples of Equivalence Relations 


Any intuitive concept of “equivalence” ought to share some of the basic properties of logical 
equality. It turns out that reflexivity, symmetry, and transitivity are the key properties we 
need. 


6.21. Definition: Equivalence Relations. Given a relation R on a set X, 


R is an equivalence relation on X | iff| R is reflexive on X, symmetric, and transitive |. 


Expanding this definition, we see that R is an equivalence relation on X iff the following 
three conditions hold for all a,b,c € X: (i) aRa; (ii) if aRb then bRa; (iii) if aRb and bRc, 
then afc. 


6.22. Example: Equality Relation. For any set X, the identity relation Ix = {(x,2) : 
xz € X} is an equivalence relation on X. If we write x = y instead of Ixy, the three 
conditions above become: (i) a = a; (ii) if a = b, then b = a; (iii) if a = b and b = c, then 
a=c. 


6.23. Example. For any set X, R= X x X is an equivalence relation on X. In this case, 
«Ry is true for all pairs of objects x,y € X. So it is immediate that conditions (i), (ii), and 
(iii) hold for this R. 


6.24. Example. Let X = Z and R = {(z,y) € Z? : x + y is even}. We saw in the last 
section that R is reflexive on X, symmetric, and transitive. Thus, R is an equivalence 
relation on X. On the other hand, let S = {(x, y) € Z? : x+y is odd}. S is not reflexive on 
Z, since (1,1) S, so S is not an equivalence relation on Z. 


6.25. Example. Define T = {(z, y) € R? : xy > 0}. You can check that T is reflexive on R 
and symmetric. But T is not transitive, because (1,0) € T and (0,—1) € T but (1,-1) g T. 
So T is not an equivalence relation on R. 


6.26. Example. Figure 6.4 shows the digraphs for six relations Ri,..., Rg on the set 
X = {1,2,3,4,5,6}. We see from inspection of the figure that R1, Ro, and Rg are equivalence 
relations on X, but R4, Rs, and Rg are not. R4 is not symmetric because (2,3) € R4 but 
(3,2) ¢ Ry. Rs is not reflexive on X because (5,5) ¢ Rs. Rg is not transitive because 
(1,3) € Re and (3,6) € Re, but (1,6) Z Re. 

In Figure 6.1 from the previous section, only R; (the equality relation) is an equivalence 
relation on X = {1,2,3}. Digraphs of equivalence relations have special structural features 
that we will explore later, when we discuss equivalence classes and set partitions. 
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FIGURE 6.4 
Digraphs for some equivalence relations and non-equivalence relations. 


Equivalence Relation Induced by a Function 


Given a function f : X — Y, we can create an equivalence relation on the domain of f by 
the following construction. 


6.27. Definition: Equivalence Relation Induced by a Function. Given f: X >Y, 
the equivalence relation induced by f is Rr = {(a,b) E X x X : f(a) = f(b)}. We often 


denote Ry by the symbol ~,;; thus, for all a,b E€ X,|a~, b| means | f(a) = f(b) | 


Let us check that ~p always is an equivalence relation. Fix an arbitrary function f : 
X > Y, and fix a,b,c € X. We have a ~p a because f(a) = f(a), so ~p is reflexive on 
X. To prove symmetry, assume a ~s b and prove b ~+ a. We have assumed f(a) = f(b); 
we must prove f(b) = f(a); this follows since equality is symmetric. To prove transitivity, 
assume a ~p b and b ~ș c; prove a ~p c. We have assumed f(a) = f(b) and f(b) = f(c); 
we must prove f(a) = f(c). This follows since equality is transitive. We have now proved 
~p is an equivalence relation on X. 

By choosing different functions f, we can use this construction to build a wide variety 
of equivalence relations. (In fact, we will see later that all equivalence relations arise in this 
way for an appropriate choice of f.) Here are some typical examples. 


6.28. Example. For f : R > R given by f(x) = |x|, we get an equivalence relation ~, 
on R such that x ~p y iff |x| = |y|. So, for instance, —3 ~s 3, 4 ~f 4, 0 ~p 0, but 
—3 %ş 5. Intuitively, this relation regards two real numbers as equivalent iff they have the 
same magnitude (disregarding sign). The function g : R > R given by g(x) = x? induces the 
same equivalence relation as f. For, given any x,y € R, £ ~y y iff a? = y? iff Va? = yy? 
iff || = |yl if z ~y y. 


6.29. Example. Define sgn : R > {—1,0, 1} by setting sgn(x) = —1 if x < 0, sgn(x) = 0 
if x = 0, and sgn(#) = 1 if x > 0. The equivalence relation ~sgn regards two real numbers 
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as equivalent iff they have the same sign. For example, —3 ~sgn —7, 0 ~sgn 0, 2 ~sgn 7, but 
2 PE sen —2. 


6.30. Example. Define f : R > R? by f(t) = (cost, sint) for all t € R. From trigonometry, 
we know that for all s,t € R, f(s) = f(t) iff s = t + 27k for some integer k. Intuitively, ~p 
regards two real numbers as equivalent iff those numbers represent the same angle going 
around the unit circle. The individual sine and cosine functions induce more complicated 
equivalence relations on R. For instance, 7/3 ~sin 27/3 and 1/3 ~cos —7/3, although no 
two of the numbers —7/3, 1/3, 27/3 are related under ~,. 


6.31. Example. Define f : Z>o > {0,1,...,9} by letting f(n) be the rightmost digit in 
the decimal representation of n. For instance, f(5247) = 7, f(10? — 1) = 9, and f(10k) = 0 
for any k € Zyo. The relation ~ş regards two positive integers as equivalent iff they have 
the same last digit. 


Congruence Modulo n 


We now introduce a family of equivalence relations that play a fundamental role in number 
theory. 


6.32. Definition: Congruence Modulo n. For each positive integer n > 1, define a 


relation =, on Z as follows: for all a,b € Z, |a =n b| iff|n divides a — b |. When a =, b, we 


also use the notation | a = b (mod n) |, which can be read “a is congruent to b modulo n.” 


The next theorem says that congruence modulo n is an equivalence relation that is 
compatible with the operations of integer addition and multiplication. 


6.33. Theorem on Congruence Modulo n. For all n € Z> and all a,b,c,d € Z: 
(a) =, is an equivalence relation on Z. 

(b) Ifa =, b and c=, d, then a +c =n b+d. 

(c) If a =n b and c =n d, then ac =n bd. 


Proof. For (a), we must show =, is reflexive on Z and symmetric and transitive. To prove 
=, is reflexive on Z, fix a € Z. We prove a =, a, which means n divides a — a. Since 
a—a=0=0n where 0 is an integer, n does divide a — a. To prove symmetry, fix a,b € Z; 
assume a =n b; prove b =n a. We have assumed n divides a — b, so a — b = kn for some 
integer k. We must prove n divides b — a. Since b — a = —(a — b) = —(kn) = (—k)n where 
—k is an integer, n does divide b — a. To prove transitivity, fix a,b,c € Z; assume a =n b 
and b =n c; prove a =n c. We have assumed n divides a — b, so a — b = dn for some integer 
d; and n divides b—c, so b — c = en for some integer e. We must prove n divides a—c. Note 
a—c=(a+b)+(b—c) = dn + en = (d + e)n where d + e is an integer; so n does divide 
a—c. 

We prove (c), leaving (b) as an exercise. Assume a =,, b and c =n d, so a — b = jn and 
c— d = kn for some j,k € Z. We must prove ac =n bd, i.e., ac — bd = pn for some p € Z. 
Compute 


ac — bd = ac — ad + ad — bd = a(c — d) + (a — b)d = a(kn) + (jn)d = (ak + jd)n. 


Choosing p = ak + jd, which is an integer, we then have ac — bd = pn as needed. 


6.34. Example. When n = 2, a => b iff 2 divides a — b. By considering cases, we see that 
for all a,b € Z, a =2 b iff a and b are both even, or a and b are both odd. Thus, =2 regards 
two integers as being equivalent iff these integers have the same parity (odd or even). 
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6.35. Example. Taking n = 10, we have 5247 =10 7, 115 =10 9015, but 4441 415 4444. 
For all positive integers a,b, we have a =9 b iff a and b have the same last digit written in 
decimal iff a ~p b, where f is the function from Example 6.31. However, we must be careful 
when negative integers appear. For instance, 17 =19 —3 since 10 divides 17 — (—3) = 20, 
but 17 #19 —7 since 10 does not divide 17 — (—7) = 24. 


Further Examples of Equivalence Relations 


When considering an equivalence relation (or any relation) on a set X, the objects in X 
can be anything. In the next example, the objects are ordered pairs (points or vectors in 
the xy-plane), so that the relation itself is a set of ordered pairs where each component is 
an ordered pair. In this situation, infix notation greatly improves readability. 


6.36. Example. Let X = R x R. Define a relation R on X by setting (a, b) R(c, d) iff there 
exists r € Ryo with c= ra and d = rb. We now prove that R is an equivalence relation on 
X. For reflexivity, fix (a,b) € X. To prove (a,b)R(a,b), we must prove dr € Ryo, a = ra 
and b = rb. Choosing r = 1, these equations do hold. For symmetry, fix (a,b), (c,d) € X, 
assume (a,b)R(c,d), and prove (c,d)R(a,b). By assumption, there is r € Ryo with c = ra 
and d = rb. We must prove Js € Ryo, a = sc and b = sd. Choose s = r—!, which exists and 
is a positive real number since r is positive. The assumptions c = ra and d = rb become 
a=r-'e=scandb=r'd = sd, as needed. For transitivity, fix (a,b), (c,d), (e, f) € X, 
assume (a,b)R(c,d) and (c,d)R(e, f), and prove (a,b) R(e, f). By assumption, there exist 
t,u E€ R>o with c= ta, d= tb, e = uc, and f = ud. Combining these equations, we see that 
e = uc = u(ta) = (ut)a and f = ud = u(tb) = (ut)b, where ut € Ryo since ut is a product 
of two positive real numbers. We now see that (a, b)R(e, f), as needed. 


In the next example, the elements of X are functions. 


6.37. Example. Let X be the set of all functions f : R —> R. Define a relation R on 
X by setting fRg iff f(3) = g(3) for all f,g € X. It is routine to prove that R is an 
equivalence relation by verifying the definition. Alternatively, we can deduce this using a 
previous construction, as follows. Define an “evaluation function” E : X — R by setting 
E(f) = f(8) for all f € X. Note that E is a function that takes another function as its 
input and produces a real number as its output. For all f,g € X, fRg iff f(3) = g(3) iff 
E(f) = E(g) iff f ~m g. Thus R is the equivalence relation induced by the function E. 


In the next example, the elements of X are sets. 


6.38. Example. Let X = P(Z), and for A, B € X, define A B iff AN Z>o = BN Zyo. 
The relation ~ is an equivalence relation on X. In fact, ~ is the equivalence relation ~ 
induced by the function f : X + P(Z) such that f(A) = AN Zso for all A € X. 


We now give a method for obtaining a new equivalence relation from existing equivalence 
relations. 


6.39. Lemma on Intersection of Equivalence Relations. For all sets X, the inter- 
section of any nonempty set of equivalence relations on X is an equivalence relation on 
X. 


Proof. Fix a set X, and let {R; : i € I} be a nonempty set of equivalence relations on X. 
We must prove R = fje; R; is also an equivalence relation on X. Since every R; is reflexive 
on X, the intersection R is also reflexive on X by Theorem 6.19. Similarly, by the same 
theorem, R is symmetric since all R; are symmetric, and R is transitive since all R; are 
transitive. So R is an equivalence relation on X. 
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6.40. Example. Given X = Z, we claim that =3 N =5 equals =15. Writing Rn for the 
relation =,, we are claiming that the two sets of ordered pairs R3 N Rs and Ry; are equal. 
[We prove the set equality using a chain proof, observing that all members of both sets are 
ordered pairs of integers.] Fix an arbitrary pair (a,b) € Z x Z. Note that (a,b) € R3 N Rs iff 
(a,b) € R3 and (a,b) € Rs iff 3 divides a—b and 5 divides a—b iff 15 divides a—b iff (a,b) € 
Ri5. The next-to-last equivalence can be proved by considering the prime factorization of 
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a — b. More generally, you can check that for all m,n € Z>0, =m N =n equals =jcm(mjn)- 


Section Summary 


1. 


Equivalence Relations. R is an equivalence relation on a set X iff R is a relation 
on X that is reflexive on X, symmetric, and transitive. The intersection of any 
nonempty set of equivalence relations on X is also an equivalence relation on X. 


Equivalence Relation Induced by a Function. Given any function f : X > Y, 
there is an induced equivalence relation ~ on X defined by a ~, b iff f(a) = f(b) 
(where a,b € X). 


Congruence mod n. For each n > 1, the relation =, defined by a =n b iff n divides 
a — b (for a,b € Z) is an equivalence relation on Z. This relation is compatible 
with addition and multiplication: for all a,a’,b,b! € Z, if a =, a’ and b =n U, 
then a+b =, a’ +0! and ab =, a'b'. When a =n b, we also write a = b (mod n). 


Exercises 


ils 


(a) Which relations shown in Figure 6.3 are equivalence relations on X? (b) Re- 
peat (a) replacing each relation R; by R; U Ix. 


Let X = {1,2,3,4,5,6}. For each function f : X — {a,b,c,d,e}, describe the 
equivalence relation ~ş as a digraph and as a set of ordered pairs. 

(a) f(x) = c for alla € X. 

(b) f(x) =a for odd x € X, f(x) =e for even x € X. 

(c) f(1) =b, f(2) =e, f(3) =a, f(4) =b, f(5) =e, f(6) = 

(d) F1) =a, f(2) =b, f(3) =c, f(4) =d, f(5) =e, f(6) = 

Let X = {1, 2,3,4, 5,6,7,8}. It is known that R is an equivalence relation on X 
containing these ordered pairs: (2,4), (5,6), (1,3), (4,6), (8,3). Find the relation 
R (satisfying these conditions) such that R has the smallest possible number 
of elements. Display R as a set of ordered pairs and as a digraph. [Hint: Since 
(2,4) € Rand Ris symmetric, R must also contain (4, 2). Make further deductions 
of this kind to see what other ordered pairs must be in R.] 


Draw the digraph of the smallest equivalence relation R on the set X = 
{1,2,3,4,5,6,7,8} such that R contains the ordered pairs (1,3), (1,5), (1,7), 
(2,2), and (8, 6). 

Show that each relation R is an equivalence relation on X by finding a function 
f:X >Y such that R is ~p. 


= R?, (a,b) R(c, d) iff (a,b) and (c,d) lie on the same vertical line. 
= R?, (a,b) R(c, d) iff (a,b) and (c,d) lie on the same circle centered at the 
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origin. 
(£) X = R20, (a,b)R(c,d) iff (a,b) and (c,d) lie on the same line through the 
origin. 
(g) X =R, «Ry iff x,y € R belong to the same interval (n,n + 1] with n € Z. 

6. (a) Find all b € Z such that 7 =o b. (b) Find all b € Z such that 7 =; b. (c) Find 
all b € Z such that 7 =10 b. 


7. Find all n such that 111 =,, 471. 
8. Prove part (b) of the Theorem on Congruence Modulo n. 
9. Fix n> 1. (a) Prove: for all a,b € Z, if a =, b, then —a =n —b. (b) Prove: for all 
a,b,c,d € Z, if a=, b and c=, d, then a — c =n b — d. 
10. Prove by induction on k: for all n € Zs; and all k € Z>o and all a,b € Z, if a = b 
(mod n), then a! = b* (mod n). 
11. Draw the digraphs of all equivalence relations on the set X = {1, 2,3}. 


12. Give a specific example to show that the union of two equivalence relations on 
the set {1,2,3,4} need not be an equivalence relation on this set. 


13. Let X = Z x Z. Define a relation R on X by setting (a, b)R(c, d) if a+d = b +c. 
Prove that R is an equivalence relation on X. 


14. Let X = Z x Zyo. Define a relation R on X by setting (a, b)R(c, d) iff ad = be. 
Prove that R is an equivalence relation on X. Would this still be true if we had 
used X = Z x Z? 


15. Define a relation ~ on R” by setting v ~ w iff there exists r € Rzo with w = rv 
(i.e., wi = rv; for i = 1,2,...,n). Prove that ~ is an equivalence relation on R”. 
Would this still be true if we had used r € R instead of r € Rzo? 


16. Given sets A and X with A C X, let R = (A x A)U((X — A) x (X — A)). Prove 
R is an equivalence relation on X. 


17. Let S be any relation from X to Y. Define a relation ~s on X by setting (for 
a,b E€ X) a ~s b iff S[{a}] = S[{b}]. Is ~s an equivalence relation on X? 

18. Fix n > 1, and define f : Z > {0,1,...,n — 1} by letting f(a) be the unique 
remainder in {0,1,...,n — 1} when a is divided by n. Prove that =, is the 
equivalence relation ~f. 


19. Let X be any collection of functions with a common domain D. Let S be a fixed 
subset of D. For f,g € X, define f ~ g iff f(s) = g(s) for all s € S. Prove ~ is 
an equivalence relation on X. 


20. Let X be any collection of sets. For A,B € X, define A ~ B iff there exists a 
bijection f : A > B. Prove ~ is an equivalence relation on X. 


21. (a) Given functions f : X + Y and g : Y — Z, prove that ~ + is a subset of ~gof. 
(b) Prove that if g is injective, then ~» equals ~gof. 


22. Write Rm for the relation =m. Given positive integers n1,...,nx%, prove that 
REA Rn; = Ry, where N = lem(n1,..., np). 


23. Given any relation S on a set X, let R be the intersection of all equivalence 
relations on X containing S. (a) Show that R is an equivalence relation on X 
containing S, such that R C R’ for all equivalence relations R’ on X containing 
S. (b) Let T = U7, (SUS! U Ix)” (this notation is defined in Exercise 16 of 
86.1). Prove R= T. 
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6.3 Equivalence Classes 


Intuitively, an equivalence relation R on a set X lets us “clump together” all the objects 
in X that are the same in some way, creating a collection of subsets of X. For instance, 
congruence modulo 2 decomposes the set of integers into the subset of odd integers and the 
subset of even integers. The subset of X consisting of all objects in X related under R to 
a given object b is called the equivalence class of b determined by R. We can gain a lot of 
insight into an equivalence relation by studying its equivalence classes. We consider many 
examples of equivalence classes in this section. We then show that any two equivalence 
classes of R are either equal or do not overlap at all, so that every element of X belongs to 
exactly one equivalence class. 


Definition and Examples of Equivalence Classes 


Consider again the digraphs of the three equivalence relations shown in the top row of 
Figure 6.4, which are redrawn in Figure 6.5 below. We notice that the arrows in the digraph 
divide the points of X into different clumps, where all pairs of points in a given clump are 
connected by relation arrows, but no arrow goes between points in two different clumps. 
The next definition helps us understand this situation better by giving names to the clumps 
— they are the equivalence classes of the given equivalence relation. 


6.41. Definition: Equivalence Classes. Let R be an equivalence relation on a set X. 
For all b € X, the equivalence class of b determined by R, denoted fb]r, is the set of all 
objects related to b by R. In symbols, 


ble = {x € X : (a,b) € R}. 


Since R is symmetric, we can restate this definition in infix notation as follows: 


for all b,x € X, |x € [b]r|&|(x,b) E€ R| &|zRb]|& |bRz]. 


If R is understood from context, we may write |b] to abbreviate [b]rR. Given any set S, 


S is an equivalence class of R | iff | db € X, S = [b] r 


6.42. Example. Let us compute the equivalence classes for the equivalence relations shown 
in Figure 6.5. For R,, we find that 


[Mr = {1,4}, []r, = {2,3}, Bla, = {2,3}, Kr = {1,4}, Bla, = {5,6}, [6]r, = {5, 6}. 


Notice that [l1]r, = [4]r,, [2]r, = [3] r,, and [5)r, = [6]r,. In general, for an equivalence 
relation R, it atten happens that a # b and yer la alr = b |r. This means that a given 
equivalence class S may have several different names |a]r for various choices of a € X. In 
fact, we will prove shortly that for all a,b € X, |a]r = [b]r iff aRb. In the case of Rj, there 


are only three distinct equivalence classes, specifically {1,4} and {2,3} and {5,6}, but each 
equivalence class has two different names. The fact that a given equivalence class may have 
multiple names is a confusing but fundamental feature of this concept. 

Turning to R2, inspection of the digraph shows that 


(Ir. = {1,2} = [2] rz, [3]r. = {3}, [4] r = {4, 5, 6} = [5] 2 = [6] r2- 
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FIGURE 6.5 
Digraphs for some equivalence relations. 


We see that there are three distinct equivalence classes of R2, namely {1,2} and {3} and 
{4,5,6}. The class {1,2} has two members and two names; the class {3} has one member 
and one name; the class {4, 5,6} has three members and three names. In general, if S is an 
equivalence class of an equivalence relation R, each member a € S' gives us a possible name 
(alr for the equivalence class S. For this reason, each a € S is called a representative of the 
equivalence class S. 

Finally, Rs has three equivalence classes: [2)r, = {2}, [5]r, = {5}, and [1]r, = 
{1, 3,4, 6} [3] rs [4] Rs [6] rs- 


6.43. Example. For any set X, consider the equality relation Ix = {(x,£z): x € X}. 
For all a,x € X, Iya holds iff z = a, so that [a];, = {a}. In this case, each equivalence 
class consists of a single element. Intuitively, the relation of logical equality never clumps 
together two unequal objects. (See the first digraph in Figure 6.1.) At the other extreme, 
consider the equivalence relation R = X x X. For all a,x € X, xRa is true, so |a]r = X for 
all a € X. In this case, there is a single equivalence class consisting of every element of X. 
This equivalence relation clumps together all objects in X. 


Equivalence Classes of ~; 


We saw in §6.2 that for any function f : X — Y, there is an associated equivalence relation 
~p on X such that for all a,b € X, a ~p biff f(a) = f(b). So for each a € X, the equivalence 
class of a is 


[a]~, ={xE€X:x~xfa}={xE X: f(x) = f(a)} = FHS (a)}. 


This equivalence class is sometimes called the fiber of f over f(a). The figure in the next 
example explains this terminology. 


6.44. Example. Let X = {1,2,3,4,5,6}, Y = {r,s,t,u}, and define f : X — Y by 
f(1) = £ (2) = f (3) = s, f(4) = f (6) = t, and f(5) = u. Figure 6.6 shows the arrow diagram 
for f and the digraph for ~y in a single picture. The arrows leading into a given point in the 
image of f resemble threads in a fiber starting at that point and leading back into the domain 
of f. The distinct equivalence classes of ~, are [1], = {1,2,3} = 2]., = [3]., = f'Hs}, 
[4]~, = {4,6} = [6]., = f+ [{t}], and [5])., = {5} = f Hu}. As before, we can specify an 
equivalence class by listing its elements or by picking a representative b and using the name 
[b]~,. In this situation, we can also use fiber notation for an equivalence class, e.g., denoting 
{1,2,3} as f~1[{s}]. The fiber f~1[{r}] is the empty set, which is not an equivalence class. 
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arrow diagram for f 


digraph for ~ ¢ 


FIGURE 6.6 
A function f and its associated equivalence relation ~p. 


6.45. Example. For f : R > R given by f(x) = |z|, the equivalence classes (fibers) 
are [z]~, = {x,—2x} for each x € R. For example, [3]~, = {3,—-3} = [-3]~,, [-7]~,; = 
{-7,7} = [7]~,, and [0|., = {0, —0} = {0}. The function g : R > R given by g(x) = x” has 
the same induced equivalence relation and hence the same equivalence classes. The function 
sgn : R > R has three distinct equivalence classes: the set Reo of negative numbers, the 
set {0}, and the set Rso of positive numbers. For instance, [—8] = [-0.001} - 


[-27]~... = Reo. 
6.46. Example. Define f : R > R? by f(t) = (cost,sint) for t € R. By trigonometry, 
f(s) = f(t) iff s = t+ 27k for some integer k. Therefore, 

tl~; ={s ER: f(s) =f®}={s eR: 3k € Zs =t + Ik}. 


For instance, [0]~, = {0,+27,+47,...} = [27]~,, and [1/3]., = {2/3, 71/3, 137/3,..., 
—5r/3, —117/3,...}. In this example, each equivalence class has infinitely many members 
(representatives) and hence infinitely many names. 


“sgn 


More Examples of Equivalence Classes 


6.47. Example. Fix n > 1, and consider congruence modulo n on the set Z. For brevity, 
let us write [a], instead of [a]=,. For each a € Z, we have 


Ian = {£ €Z: x=, a} = {x € Z:n divides z — a} = {x € Z : Jk e€Z,4-—a=kn} 
{a €Z:5ke€Z,x=at+kn}={a+kn:k eZ}. 


So, for example, [0]; = {0,+5,+10,+15,...} = [5]s = [10]; = [—15]5 = [5k]s for any k in 
Z; [l]a = {1,5,9,13,...,-3,-7,-11,...}; [77710 = {7, 17, 27, 37,...,-3, —13, —23,...} = 
[7]10, and so on. When n = 2, the relation => has two distinct equivalence classes, the set 
of even integers and the set of odd integers. We can describe these sets as [O]2 and [1]2, 
respectively. More generally, we will prove later that =, has exactly n distinct equivalence 
classes, which can be named as [0]n, [1]n,---, [n — 1]n. The set of these n equivalence classes 
is called the set of integers modulo n. We discuss the algebraic structure of the integers 
modulo n in §6.6. 


6.48. Example. Let X =R x R. For all (a,b), (c,d) € X, define (a,b) R(c, d) iff a? + b? = 
c? +d’. R is the equivalence relation induced by the function f : X — R given by f(a,b) = 
242. The equivalence class of (3, 4) € X is [(3, 4)]r = {(z, y) € X : a? +y? = 3+4? = 25}, 


Equivalence Relations and Partial Orders 291 


which is the circle of radius 5 centered at the origin. More generally, for any ao, bo € R, 
[(ao, bo)] = {(a,y) E X : £? + y? = af + bê} is the circle of radius ro = yag + b? centered 
at the origin. A degenerate case occurs at the origin: [(0,0)]r = {(0,0)}. Note that every 
point in the plane belongs to exactly one circle centered at the origin. So the full set X has 
been decomposed into a union of equivalence classes (the circles), where any two unequal 
equivalence classes are disjoint (have empty intersection). 

Now consider another relation S on X, defined by (a, b)S(c, d) iff there exists r € Ryo 
with c = ra and d = rb. We saw in the last section that S is an equivalence relation on 
X. The equivalence class of (3,4) € X determined by S is [(3,4)]5 = {(8r,4r) : r € Ryo}, 
which is the ray from the origin through the point (3, 4) [excluding the origin itself]. More 
generally, each equivalence class [(ao, bo)|g is a ray starting at (but excluding) the origin 
and passing through (ao, bo). As before, the origin is a special case: [(0, 0)|5 = {(0,0)}. Once 
again, X has been decomposed into a union of equivalence classes (the rays, along with the 
origin), where any two equivalence classes are either disjoint or equal. 


6.49. Example. Let X be the set of subsets of {1,2,3}, and define a relation ~ on X by 
letting A ~ B iff A and B have the same number of elements. Then ~ is an equivalence 
relation on X with the following equivalence classes: [Ø] = {0}; 


[{1}]~ = {{1}, {2}, {33} = 23]. = L3H; 
[{1,2}]~ = {{1, 2}, {1, 3}, {2,3}} _ [{1,3}]~ = [{2, 3}]~; 


and [{1,2,3}]~ = {{1, 2,3}}. Here too, every member of X belongs to exactly one equiva- 
lence class, and two distinct equivalence classes do not share any common members. 


6.50. Example. Let X be the set of all functions f : R —> R, and for all f,g € X, define 
f ~ g iff f(3) = g(3). The equivalence class of the squaring function s (given by s(x) = x? 
for x € R) is [s] ~ = {f € X : f(3) = 5(3)} = {f € X : f(3) = 9}. The equivalence class of 
the identity function is [Idg]~ = {f € X : f(3) = 3}. The equivalence class of a constant 
function fe with constant value c € R is [f.]~ = {f € X : f(3) = c}. In this example, each 
equivalence class contains exactly one constant function, which could be singled out as a 
particularly natural choice of a representative for that equivalence class. The constant tells 
us the value of f at input 3 for all functions f in the equivalence class. 


Properties of Equivalence Classes 


The following theorem lists some key features of equivalence classes illustrated by the pre- 
ceding examples. 


6.51. Theorem on Equivalence Classes. Let R be an equivalence relation on a set X. 


(a) When Equivalence Classes are Equal: For all a,b € X, the following five conditions are 
equivalent: [alr = |[b]r; b € [a]r; bRa; aRb; a € [b|r. 

(b) Disjointness of Equivalence Classes: For all equivalence classes S, T of R, if S 4 T, 
then SMT =. So two equivalence classes are either disjoint or equal. 


(c) Partitioning Property: For all a € X, there exists a unique equivalence class S of R with 
a € S, namely S = [a]r. Every equivalence class S is nonempty. 


Proof. (a) Fix a,b € X. We give a circle proof that the indicated conditions are equivalent. 
First assume [a] = [b]r. We know bRb since R is reflexive on X, so b € [b]r by definition. 
As the sets [a] r and |b] g are equal, we see that b € [a] r. Next, assume b € [a] r. By definition 
of equivalence class, our assumption means that bRa. Next, assume bRa. Then aRb because 
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R is symmetric. Next, assume aRb. Then a € [b]r by definition. Finally, assume a € [b]pr, 
and prove [alr = [b|r. Our assumption means aRb, and hence bRa by symmetry. Let us 
prove [alr C [b]r. Fix c; assume c € [a]r; prove c € [b]r. We have assumed cRa and must 
prove cRb. Using cRa, aRb, and transitivity, we deduce cRb, as needed. Now we prove 
[blr C [a]r. Fix d; assume d € [b] rR; prove d € [a]r. We have assumed dRb and must prove 
dRa. Since dRb and bRa are known, transitivity gives dRa, as needed. We have now shown 
that [alr = [b] r. 

(b) Fix two equivalence classes S,T of R. Using a contrapositive proof, we assume 
SAT #9 and prove S = T. By definition of equivalence class, there exist a,b € X with 
S = [alr and T = [b)r. Our assumption means that there exists c € SMT. Thus, c € [a]r 
and c € [b]r, so that cRa and cRb by definition. Now symmetry gives aRc, and transitivity 
gives aRb. By part (a), we conclude that [a]r = [b]r, which means that S = T. 

(c) Fix a € X. By reflexivity, aRa, and hence a € [a] rR. Thus, S = [a] p is an equivalence 
class of R containing a. To prove uniqueness, suppose T and U are any two equivalence 
classes of R with a € T and a € U; we must prove T = U. This follows from part (b), since 
a € TANU and hence TAU 4 9. Finally, any equivalence class V has the form V = fb]r for 
some b € X. Since b € [b]r as remarked earlier, V must be nonempty. 


Regarding part (c) of the theorem, note that each a € X belongs to a unique equivalence 
class of R; but the name of this equivalence class is usually not unique. Part (a) of the 
theorem tells us all possible names of a given equivalence class: [b]R is another name for 
the set [a] iff [b]r = [alr iff aRb iff b € [a]r. This confirms our earlier remark that each 
representative (member) of a given equivalence class provides one of the possible names for 
this equivalence class. 


Section Summary 


1. Equivalence Classes. Given an equivalence relation R on a set X and given b € X, 
the equivalence class of b determined by R is [b]r = {x € X : Rb}. So z € [blr 
iff x Rb iff bRx. A set S is an equivalence class of R iff 3b € X, S = [b]r. 

2. Fibers of a Function. Given f : X — Y and y € Y, the fiber of f over y is 
fHu}. For any a € X, [a]., = {x € X : f(a) = f(x)} = FHF (a)}, which is 
the fiber of f over f(a). 


3. Equivalence Classes of Congruence mod n. For all n > 1 and a € Z, [aln = 
{x €Z:x2=a (mod n)} = {a + kn: k € Z}. 

4. Theorem on Equivalence Classes. Given an equivalence relation R on a set X and 
a,b € X, |a]r = [b]r iff aRb iff bRa iff b € [a]p iff a € [b]. For all equivalence 
classes S, T of R, if S A T, then SN T = Q. Equivalence classes are nonempty, 
and every a € X belongs to exactly one equivalence class of R, namely [a]. 


Exercises 
1. For the equivalence relation R3 shown in Figure 6.3, explicitly list all elements in 
[k]r forl<k <7. 
2. Let X = {1,2,3,4,5} and R = Ix U {(1,3), (3, 1), (3,5), (5,3), (1,5), (5, 1)}. Find 
all equivalence classes of R. 
3. Let X = {1,2,3,4,5,6}. For each function f : X — {a,b,c,d,e}, find all equiva- 
lence classes of ~f. 


(a) f(z) =c for alla € X. 
(b) f(x) =a for odd x € X, f(x) =e for even x € X. 
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c) f(1) = 6, f(2) =e, f(3) =a, f(4) =b, f(S) =e, f(6) =a. 

d) f(1) =a, f(2) =b, f(3) =c, f(4) =d, f(5) =e, f(6) =b. 

For each function f : R — R, describe the requested equivalence classes of ~p. 
a) f(x) = zf; find [—3], [2], and [0]. 


b) f(x) = sin z; find [0], [7/2], and [47/3]. 
c) f(x) = 3x — 7; find [2], [5], and [—3]. 
d) f(x) = 1 if x is rational, and 0 otherwise; find [3], [9/4], [z], and [e]. 


e) f(a) = |x| if x is an integer, and 0 otherwise; find [4], [4.5], and [0]. 


For each set X, equivalence relation R, and object b € X, describe the equivalence 
class [b]r as explicitly as possible. Informally explain each answer. 

a) X = {1,2,3,4,5,6}, R=Iy, b=3. 

b) X = {1,2,3,4,5,6}, R= X x X,b=3. 

c) X = {1,2,3,4,5, 6}, R = ({1,3,5} x {1,3,5}) U ({2, 4, 6} x {2,4,6}), b= 3. 
d) X = Z, R is congruence mod 2, b = 7. 

e) X =Z, R is congruence mod 7, b = 7. 


For x € R, let [a] be the least integer > x. For instance, [5] = 5, [7] = 4, and 
[—1/2] = 0. For x,y € R, define (x,y) € R iff [x] = [y]. (a) State why R is an 
equivalence relation on R. (b) Describe the equivalence class [7]. (c) Find three 
different real numbers x,y,z such that [t]r = [z]r = [y]r = [z]r. (d) Draw a 
picture of the real line showing all the distinct equivalence classes of R. 


For each equivalence relation R on X = R?, draw the equivalence classes [(1, 2)] g, 
(3,4)] Rr, [(—4, —3)] er, I(l, D] ez, and [(0,0)]e. Are any of these classes equal? 

) (a, b)R(c,d) means a = c. 

R(c,d) means b = d. 

R(c,d) means a — b = c — d. 

) (a,b) R(c, d) means a? + b? = c? + d?. 

(e) (a, b)R(c, d) means (c,d) = (ra, rb) for some r € Ryo. 

Define an equivalence relation ~ on the set of nonempty subsets of {1,2,3,4} 
by letting A ~ B mean that A and B have the same least element. Find all 
equivalence classes of ~. 

Describe an equivalence relation on R? whose equivalence classes are squares 
centered at the origin. 

Describe an equivalence relation R on Z such that for each k € Zso, R has an 
equivalence class of size k. 

Find all positive integers n such that [—13]n = [37]n. 

Find all equivalence relations R on X = {1,2,3,4,5} such that [l]r = [4]r and 
2]r = [B]r. 

Prove: for all sets X, all equivalence relations R on X, and all b € X, [blr = 
RU{O}]. 

Prove: for all sets X and all equivalence relations R on X, R = U,ex [alr x (alr. 
Prove: for all sets X and all equivalence relations R,S on X, R C S iff for all 
a € X, [alr C [als. 

Prove: for all sets X, all equivalence relations R, S on X, and all a € X, [alrns = 
ale N [a]s. 
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6.4 Set Partitions 


This section introduces set partitions, which are ways of dividing a given set into collections 
of non-overlapping subsets. For example, the set of equivalence classes of a given equiva- 
lence relation on a set X is always a set partition of X. It turns out that this set partition 
contains enough information to reconstruct the equivalence relation that produced it. This 
observation leads us to the main theorem of this section, which states (intuitively) that 
equivalence relations and set partitions are two different ways of capturing the same under- 
lying concept. More formally, for any set X, there is a natural one-to-one correspondence 
(bijection) between the set of all equivalence relations on X and the set of all set partitions 
of X. 


Definition and Examples of Set Partitions 


To motivate the introduction of set partitions, consider Figure 6.7. The top row shows the 
digraphs of three equivalence relations on the set X = {1,2,3,4,5,6}. The digraphs are 
rather cluttered because of all the arrows joining pairs of objects in the same equivalence 
class. The second row of the figure reduces the clutter by omitting all the arrows and circling 
elements that belong to the same equivalence class. Intuitively, the new diagrams contain 
exactly the same information as the old diagrams, since we can reconstruct the arrows by 
drawing all possible arrows between two objects in the same little oval. 


2 P; 


FIGURE 6.7 
Digraphs for equivalence relations and the associated set partitions. 


We can mathematically model each picture in the second row of Figure 6.7 as a set of 
subsets of X. For example, the first picture is encoded by P) = {{2,3}, {1,4}, {6,5}}. 
The next two pictures correspond to the sets P> = {{1,2},{3},{4,5,6}} and P; = 
{{2}, {1, 3, 4, 6}, {5}}. Note that the order in which we list the subsets, as well as the order 
in which the members of each subset are listed, does not matter. For instance, we could 
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also write P> = {{3}, {5, 6, 4}, {2, 1}}. Also note that the subsets in each P; do not overlap 
each other, and every element of X belongs to one of the subsets. Each P; is an example of 
a set partition, which we formally define as follows. 


6.52. Definition: Set Partitions. For all sets X and P, |P is a set partition of X | iff: 
(a) VSEP,SAMand SCX; 

(b) Va € X,4S € P,x € S; and 

(ce) YS,T E P,S 4T =>SAT=0. 

Each set S € P is called a block of the set partition P. 


Condition (a) says that a set partition is a set of nonempty subsets of X. Condition (b) 
says that every element of X belongs to some subset in the set partition. Condition (c) says 
that any two blocks of the set partition are either equal or disjoint. When proving part (c), 
we often check the contrapositive: for all S,T € P, if SAT #9, then S = T. Conditions 
(a) and (b) together show that X = Usep S, so that the whole set X is the union of all the 
blocks of the set partition. Conditions (b) and (c) are equivalent to the requirement that 
for all x € X, there exists a unique block S € P with x € S (see Exercise 9). 


6.53. Example. Which of the following are set partitions of X = {1, 2,3,4,5}? 


P, = {{1,3, 4}, {2, 5}}; Py = {1,2,3,4,5}; P; = {{1,2,3,4,5}}; 
P, {{1, 5}, {2,4}}; P; = {{1, 3, 5}, {2, 4}, Ø}; Pe = {{1, 2}, {3, 4, 5}, {2, 1}}; 
P; = {{1, 2,3}, {3,4,5}; P; = {{1}, {2}, {3}, {4}, {5} }5 Py = {{1, 2, 3}, {4, 5, 6}}. 


Solution. P, is a set partition of X with blocks {1,3,4} and {2,5}. P> is not a set partition 
of X because the elements of Pz are members of X, rather than subsets of X. P} is a set 
partition of X whose only block is the full set X = {1,2,3,4,5}. At the other extreme, Ps 
is a set partition with five blocks each containing a single element. P, is not a set partition 
of X because 3 € X does not belong to any block of P4; note Py, is a set partition of the 
smaller set {1,2,4,5}. Ps is not a set partition because one of its members is the empty set, 
and all blocks are required to be nonempty. Ps is a set partition with two distinct blocks 
{1,2} and {3, 4,5}; it does not matter that the block {1,2} was listed twice. Py is not a set 
partition because the two unequal subsets {1, 2,3} and {3, 4,5} have nonempty intersection. 
Finally, Py is not a set partition of X since one of the members of P), namely {4,5,6}, is 
not a subset of X. However, P is a set partition of the larger set {1, 2,3, 4,5, 6}. 


II 


The Set Partition Induced by an Equivalence Relation 


The major point of this section is that every equivalence relation on X determines a set 
partition of X, and vice versa. Intuitively, given an equivalence relation R on X, we obtain 
the associated set partition P by forming the set of all equivalence classes of R. The next 
definition gives a name to this set partition. 


6.54. Definition: The Quotient Set X/R. Given a set X and an equivalence relation 
R on X, the quotient set X/R is the set of all equivalence classes of R. Formally, X/R = 
{[la]r : a € X}. So for all sets S, | S € X/R| iff | Ja € X, S = |a]r |. X/R is also called the 
set partition induced by R. 


The fact that X/R really is a set partition of X follows from the Theorem on Equivalence 
Classes, parts (b) and (c). 
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6.55. Example. Let X = {1,2,3,4,5,6}. For any equivalence relation R on X, we always 
have X/R = {[1]r, [2], [3] ex; [4], [5]; [6]r}. But this presentation is misleading because a 
given equivalence cles may be listed more than once (compare to the set partition Pg in 
Example 6.53). We get a better understanding of the quotient set by choosing a single name 
for each distinct equivalence class. For example, consider the equivalence relations R1, R2, 
and Rz from Figure 6.7. We find that 


X/Rı = = 114, A}, {2, 3}, {5, 6} = = {[1 Jr,» [2 Jr» [5 Jz} = {[4 Jri; [3 Jra; [6]r, }; 


X/Rz = {{1, 2}, {3}, {4, 5, 6}} = {[2] rz; [B] r; [5]r:}; 
X/R; = = {{1, 3,4, 6}, {2}, {5}} = {[1]rs; [2 JRs, [5] x5 }- 


We have X/Ix = {{1}, {2}, {3}, {4}, {5}, {6}. For R = X x X, X/R = {{1,2,3,4,5,6}} = 
{[k]r} for any k € X. For the function f from Example 6.44, 


X/~s= {{1,2, 3}, {4, 6}, {5H = Hve le, Bln} = RS FL A 
6.56. Example. For f : R > R given by f(x) = |z|, 


R/~s= {[z]~, : x E R} = {{z, —xz} : x € Ryo}. 


For the sign function, R/~sen= {R<o, {0}, Ryo}. For g : R? > R given by g(x,y) = 27+ y”, 
R?/~, is the set of all circles centered at the origin (including the degenerate circle consisting 
of just the origin). For the relation S on R? from Example 6.48, R?/S is the set of all rays 
starting at the origin, together with {0}. 


6.57. Example: Integers Modulo n. For fixed n > 1, the quotient set Z/=,, is called the 
set of integers modulo n. We often write Z/n instead of Z/=,. By definition, Z/n = {[a]» : 
a € Z}. However, this presentation of the elements of Z/n contains a lot of redundancy. For 
instance, consider the case n = 5. By our calculation of equivalence classes in the previous 
section, we know that [0]5 = {5k+0 : k € Z}, [1]s = {5k+1 : k € Z}, [2]; = {5k+2 : k € Z}, 
[3]s = {5k +3 : k € Z}, and [4]; = {5k +4 : k € Z}. These five subsets of Z already exhaust 
the entire set Z; see Figure 6.8. Note, for example, that [5]; = [0]5 since 5 € [0]5; similarly 
(10]5 = [15]5 = [—5]5 = [0]s, [7]5 = [12]5 = [17|5 = [-3]s = [2]s, and so on. To summarize, 
Z/5 is the five-element set Z/5 = {[0]s5, [1]s, [2], [3]s, [4]5}, where each element of Z/5 is an 
infinite subset of Z. In §6.6, we prove the general fact that Z/n consists of the n distinct 
equivalence classes [0]n, [IJn,---, [n — In. 


The Equivalence Relation Induced by a Set Partition 


So far, we have seen how an equivalence relation R on a set X generates an associated set 
partition X/R. Next, let us start with an arbitrary set partition P on a set X and see how 
to use P to build an equivalence relation ~p on X. The key idea is that two objects in X 
are related under ~p iff those objects belong to the same block of P; compare to Figure 6.7. 


6.58. Definition: Equivalence Relation Induced by a Set Partition. Given a set 


partition P of a set X, define a relation ~p on X as follows: for all a,b € X, |a ~p b| iff 


JS €e P,a € S and b€ S|. We call ~p the equivalence relation on X induced by P. 


We check in a moment that ~p really is an equivalence relation on X. First, however, 
let us consider some examples. 
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A S 
710] |-9| |-8 | |-7 | -6é 
-5/|-4) |-3 | -2 | -1 

0 1 2 3 4 

5 6 7 8 9 
10) /11)|12 | |13 | 14 
15|/|16/||17 | |18 | 19 
20) | 21) /22 | |23 | 24 


[0] [1] [2] [3] [4] 
Z/5 


FIGURE 6.8 
The quotient set Z/5, consisting of five equivalence classes. 


6.59. Example. Suppose X = {1, 2,3,4,5} and consider the set partitions P,, P3, Ps, and 
Ps from Example 6.53. Applying the definition to P, = {{1,3, 4}, {2,5}}, we find that 


~p, = {(1,1), (3,3), (4,4), (1,3), (3, 1), (1,4), (4, 1), (3, 4), (4, 3), (2, 2), (5, 5), (2, 5), (5, 2). 


Since every x € X belongs to the same block of P3, ~p,= { (a,b) :a,b E€ X} = X x X. At 
the other extreme, ~p,= {(a,a) : a E€ X} = Ix. We can write ~p, as ({1,2} x {1,2}) U 
({3, 4,5} x {3,4,5}). If we try this construction on Py = {{1,5}, {2,4}} (which is not a set 
partition of X), we get a relation 


~p, = {(1,1), (5,5), (1,5), (5, 1), (2, 2), (4, 4), (2, 4), (4, 2)}, 


which is not reflexive on X since 3 %p, 3. However, ~p, is an equivalence relation on the 
smaller set {1,2,4,5}. Similarly, ~p, is not an equivalence relation on X since it is not a 
relation on X, but it is an equivalence relation on the larger set {1,2, 3,4,5,6}. Finally, 
if we apply the definition above to Py (which is not a set partition of any set), we get a 
non-transitive relation ~p,, since 1 ~p, 3 and 3 ~p, 5 but 1 %p, 5. 


6.60. Lemma. For any set partition P of any set X, ~p is an equivalence relation on X. 


Proof. Fix a set partition P of a set X. By definition, ~p is a relation on X; we check ~p 
is reflexive on X, symmetric, and transitive. Fix a,b,c E€ X. By (b) in the definition of set 
partition, there exists S € P such that a € S. So “JS € P,a E€ S ^a € S” is true, which 
means that a ~p a. Thus, ~p is reflexive on X. To prove symmetry, assume a ~p b and 
prove b ~p a. We have assumed 4S € P,a € S ^b E€ S. By commutativity of AND, we see 
that 3S € P,b € S^a E S, sob ~p a holds. To prove transitivity, assume a ~p b and 
b ~p c; prove a ~p c. We have assumed there exist blocks S,T € P witha € S, b € S, 
b €T, and c € T. The critical observation is that b € S N T, so SMT is nonempty. By (c) 
in the definition of set partition, S = T follows. So now a € S and c € S$ with S € P, which 
means a ~p c. 


6.61. Example. Let X =R and P = {[n,n+ 1) : n € Z}. By drawing a picture of the real 
number line, it is intuitively evident that P is a set partition of X (see Exercise 11 in §8.5 
for a formal proof). The associated equivalence relation ~p can be described as follows: for 
x,y E€ R, x ~p y iff we obtain the same answer when we round x and y down to the next 
closest integer. Letting f(x) = |x] denote x rounded down to the next integer, we see that 
~p is the equivalence relation ~f induced by the floor function f : R > Z. 
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The Bijection Between Equivalence Relations and Set Partitions 


Fix a set X. We have now discussed a construction that maps any equivalence relation R 
on X to an associated set partition X/R on X, along with another construction that maps 
any set partition P on X to an associated equivalence relation ~p on X. We now prove the 
key point that these constructions are inverses of each other. Intuitively, this means that 
equivalence relations and set partitions are really just two different formalizations of the 
same intuitive concept of dividing the objects in X into non-overlapping clumps, where all 
objects in a given clump share some common property. 


6.62. Theorem on Equivalence Relations and Set Partitions. Fix a set X, let EQ 
be the set of all equivalence relations on X, and let SP be the set of all set partitions of 
X. The map f : EQ > SP given by f(R) = X/R for R € EQ is a bijection with inverse 
g : SP — EQ given by g(P) = ~p for P € SP. 


Proof. [This proof may be considered optional.] We have already seen that f maps EQ 
into the codomain SP (by the Theorem on Equivalence Classes), and g maps SP into the 
codomain EQ (by Lemma 6.60). It suffices to check that go f = Ideg and fog = Idgp. For 
go f, fix an equivalence relation R € EQ, let P = f(R) = X/R, and let R’ = g(f(R)) = 
g(P) = ~p. We must prove R = R’. To do so, we fix a,b € X, and we prove aRb iff aR’b. 
First assume aRb, and prove aR’b. Since aRb and bRb, we know that a € [b]r and b € [b]pr. 
Now, [b]z is one of the blocks in the set partition P. Since a and b both belong to this block 
of P, we have a ~p b by definition of ~p, and hence aR’b. Conversely, assume aR’b, and 
prove aRb. We have assumed a ~p b, which means 4S € P,a € SAb ES. Now, every block 
S in P = X/R is an equivalence class of R. So S must have the form [c]r for some c € X. 
Since a,b € [c]r, we know afc and bRc. Using symmetry and transitivity of R, we conclude 
aRb. 

Next we check f o g is the identity function on SP. Fix a set partition P € SP, let 
R = g(P) = ~pE EQ, and let P’ = f(g(P)) = f(R) = X/R. We must prove P = P’. First, 
fix S, assume S € P, and prove S € P’. Since P is a set partition of X, there exists a € X 
with a € S. The equivalence class [a]r is one of the blocks in P’, so it suffices to show 
S = [a]r. On one hand, fix b € S, and prove b € [a]z. Since b and a both belong to the 
block S of P, we have b ~p a, hence bRa, hence b € [a]r. On the other hand, fix d € [a]p, 
and prove d € S. Here dRa, sod ~p a, so JIT € P,d e T ^a €T. Now, S is the unique 
block of P containing a, so we must have T = S. Then d € S, as needed. 

Conversely, fix a new S, assume S € P’, and prove S € P. Here we know that S$ is an 
equivalence class of ~p, say S = [a]~, for some a € X. Let T be the unique block in P 
containing a; it suffices to show S = T. On one hand, fix b € S, and prove b € T. Here 
b € [a].,, so b ~p a, so there exists a block U of P containing b and a. By uniqueness, the 
only block containing a is T; so U = T, and b € T as needed. On the other hand, fix c € T, 
and prove c € S. Here c and a are in the same block T of the set partition P, so c ~p a, so 
cE [a]., = S, as needed. 


Section Summary 


1. Set Partitions. For all sets X and P, P is a set partition of X iff every S € Pisa 
nonempty subset of X; every x E€ X belongs to some S € P; and for all S,T € P, 
if SAT, then SNT =. Given a set partition P of X, every a € X belongs to 
a unique block of P. 


2. Quotient Sets. Given an equivalence relation R on a set X, the quotient set X/R 
is the set of all equivalence classes of R, namely X/R = {|a]r : a € X}. So 
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S € X/R iff da € X, S = [alr. X/R is always a set partition of X, called the set 
partition induced by the equivalence relation R. 


3. Equivalence Relation Induced by a Set Partition. Given a set partition P of a set 
X, the relation ~p, defined by a ~p b if IS € P,a E€ SADE S (where a,b € X), 
is an equivalence relation on X. Informally, this equivalence relation relates each 
pair of objects that belong to the same block of P. 


4. Correspondence between Equivalence Relations and Set Partitions. For any set 
X, there is a bijection from the set of equivalence relations on X to the set of set 
partitions of X. This bijection sends an equivalence relation R to the quotient 
set X/R. The inverse bijection sends a set partition P to the equivalence relation 
~p. Figure 6.7 gives visual intuition for how this correspondence works. 


Exercises 


1. Which of the following are set partitions of X = {v, w, x,y,z}? Explain. 

a) {{u, x, z}, {w, y}} 

b) {{v}, tw, 2}, {x,y}, tz, wh, {uf} 

c) {{v, w}, {w, 2}, {y, 2h} 

d) {{v, x,y}, {2} 

e) {v, w, £, Y, z} 

f) {{v, w, x,y, z}} 

g) {{{v, w, 2, y, z}}} 

2. Which of the following are set partitions of R? Explain. 

a) {Z, R — Z} (b) {R} (c) {{a} : a € R} (d) {(n—1,n): ne Z} 

e) {(n— 1,n]:n € Z} (£) {[n— 1,n]:n € Z} (g) {(—n,n): n € Z}. 

3. Which of the following are set partitions of some set X? For those that 
are, say what X is. (a) {1,2,{3,4}} (b) {{1,2},{3,4}} (c) {{{1, 2}}, {3, 4}} 
(d) {{1, 3}, {4,6}, {2, 8}} (e) {0, {1}, {2, 3}} €) {{O}, {1 {{1,2)} 

4. (a) List all possible set partitions of X = {1, 2,3, 4}. 
(b) Use (a) to count the number of equivalence relations on X. 


5. For each set X and equivalence relation R on X, explicitly describe the associated 
set partition X/R. 
(a) X = {a,b,c,d,e}, R = Ix U {(a, 0), (c,a), (ce), (e, c), (a, e), (e, a)}. 


e) X = P({1,2,3}), (S,T) € Riff S and T have the same size. 
f) X =R?, (a,b) R(c,d) means b — a? = d — œ. 


6. For each function f in Exercise 2 of §6.2, explicitly describe the set partition 
associated to the equivalence relation ~ş. 


( 

(c) 

(d) X =R, «Ry means y— z € Z. 
(e) 

( 


7. For each function f : X — Y, explicitly describe the set partition associated to 
the equivalence relation ~p. 
(a) f : [-1,1] > R given by f(x) = v1 — x? for x € |-1,1]. 

b) f: Z> Z given by f(n) = —7 for all n € Z. 

c) f : Z>9 > Zso given by f(0) = 0 and f(n) =n -— 1 for all n > 0. 

d) f : Z > Z given by f(n) = n/2 for n even, and f(n) = (n — 1)/2 for n odd. 


( 
(e) f :R > R given by f(x) = x — |z] for x € R. 
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8. For each set partition P of each set X, describe the induced equivalence relation 


(a) X = {a,b,c,d}, P = {{a, c}, {b, d}}. 
(b) X = {1,2,3,4,5,6}, P = {{1, 3,6}, {2,5}, {4}. 
(c) X= Z, P= {Z<o0,Zs3, {1, 2, 3}}. 
(d) X =R?, P = {{a}xR:a €R}. 
(e) X =R?°, P = {{(a,a+c):a€R}: ce R} 

9. Let P be a collection of nonempty subsets of a set X. Prove: P is a set partition 
of X iff Vx € X,AISEPawes. 

10. Let A, B,C, D be pairwise disjoint nonempty sets. 
(a) Show that R = (A x A)U (B x B) u (C x C) U(D x D) is an equivalence 
relation on X = AUBU CUD. 
(b) What is the set partition X/R? 


11. (a) Prove: for all sets X, Y, P, Q, if P is a set partition of X and Q is a set 
partition of Y and X NY = f, then PU Q is a set partition of X UY. (b) Show 
that (a) can be false without the hypothesis X NY = 0. 


12. Prove or disprove: for all sets X, Y, P, Q, if P is a set partition of X and Q isa 
set partition of Y, then P/M Q is a set partition of XN Y. 


13. Give an example of a set X and set partitions P, Q of X such that P is a four- 
element set, Q is an eight-element set, and every member of Q is a subset of some 
member of P. Is Q C P here? 


14. Prove or disprove: for all sets X, P, Q, if P and Q are set partitions of X and 
PC Q, then P= Q. 


15. Let R be an equivalence relation on a set X, and define p : X —> X/R by 
p(a) = [a]r for all a € X. (a) Prove p is surjective. (b) Prove that R equals ~p. 
[This means that every equivalence relation has the form ~p for an appropriate 
choice of f.] (c) Prove p is injective iff R = Ix. 


Equivalence Relations and Partial Orders 301 


6.5 Partially Ordered Sets 


Recall that the abstract idea of an equivalence relation came from the concrete concept of 
equality by isolating three key properties of equality: reflexivity, symmetry, and transitivity. 
In this section, we formulate the abstract concept of ordering by singling out appropriate 
properties of the concrete ordering relation < on real numbers. This relation has the fol- 
lowing properties: for all x,y,z € R, (1) x < x (reflexivity); (2) if z < y and y < z, then 
x < z (transitivity); (3) if x < y and y < zx, then x = y (antisymmetry); and (4) x < y or 
y < «x (comparability). It turns out that many mathematical relations, like subset inclusion 
and divisibility, satisfy the first three of these properties. Thus we use these properties to 
define partial order relations. Partial orderings that also possess property (4) are called total 
orders. We also discuss well-ordered sets, in which every nonempty subset of the set has a 
least element. 


Antisymmetric Relations and Partial Orders 


For all real numbers x and y, if x < y and y < x, then x = y. The generalization of this 
property to arbitrary relations is given in the next definition. 


6.63. Definition: Antisymmetric Relations. For all relations R: 


R is antisymmetric | iff | Va, b, if (a,b) € R and (b,a) € R, then a = b. 


Using infix notation, R is antisymmetric iff for all a,b, if aRb and bRa, then a = b. 


In terms of the digraph for R, antisymmetry means that for any two distinct vertices 
a and b, the arrows from a to b and from b to a cannot both appear in the digraph. In 
Figure 6.1 of §6.1, every relation shown is antisymmetric except for R4. Relation R; is both 
symmetric and antisymmetric. 


6.64. Definition: Partial Orders and Posets. For all sets X and all relations R on X: 


R is a partial order on X | iff] R is reflexive on X, antisymmetric, and transitive |. 


The ordered pair (X, R) is called a partially ordered set or poset. 


We often use the < symbol, or related symbols such as <, to denote a partial order 
relation. 


6.65. Example: Subset Ordering Relation. Let X be any collection of sets. We can 
define a relation R on X as follows: for all A, B € X, (A, B) € Riff A C B. We know that 
for all A,B,C € X, AC A (reflexivity); if A C B and B C A, then A = B (antisymmetry); 
and if AC B and B CC, then A C C (transitivity). Thus, R is a partial ordering relation 
on X, and (X, C) is a partially ordered set. 


6.66. Example: Divisibility Ordering Relation. Let X be any set of positive integers. 
Define a relation R on X as follows: for all a,b € X, (a,b) € R iff a divides b. The notation 
a|b means that a divides b. We know that for all a,b,c € Zso, ala (reflexivity); if aļb and 
bla, then a = b (antisymmetry); and if a|b and b|c, then alc (transitivity). Thus, (X,]|) is a 
partially ordered set. In contrast, (Z,|) is not a poset, because antisymmetry fails: 2|(—2), 
and (—2)|2, yet 2 4 —2. To see why antisymmetry works in Z>o, fix positive integers a, b, 
and assume a|b and bla. Because a and b are positive, we can deduce a < b and b < a (see 
Example 2.36). Then a = b because the standard ordering on integers is antisymmetric. 
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When X is finite, we can visualize a poset (X, <) using a picture called a Haase diagram. 
This picture is obtained from the digraph of the relation < by omitting all arrows that can 
be deduced from reflexivity or transitivity. It is customary to position the elements so 
that each relation arrow goes from a lower element to a higher element; this is possible by 
antisymmetry. For example, Figure 6.9 shows the Haase diagrams of four posets: the set 
P ({1, 2, 3}) ordered by set inclusion, the set of positive divisors of 30 ordered by divisibility, 
the set of positive divisors of 12 ordered by divisibility, and the set of positive divisors of 
12 ordered by the standard ordering of integers. 


30 
6 10 N, 
2 3 5 
1 


FIGURE 6.9 
Haase diagrams for four posets. 


The last two examples show that there can be more than one partial order relation on 
a given set. It also frequently happens in a poset (X, <) that there are incomparable pairs 
of elements a,b where neither a < b nor b < a is true. For instance, in the poset (Z>o, |), 
3/4 and 4|3 are both false. In contrast, for any two real numbers x and y, x Sy ory <x 
must be true. Generalizing this latter property, we arrive at the definition of total ordering 
relations. 


6.67. Definition: Total Orders. A relation < on a set X is a | total order on X |iff < isa 
partial order on X such that |Va,b E€ X, a < b or b < a. | The pair (X, <) is called a totally 
ordered set. 


We have seen above that (R,<) is a totally ordered set, but (Zso,|) is not totally 
ordered. Similarly, a poset (X,C) is not totally ordered if we can find subsets A,B € X 
such that A Z B and B Z A. For example, if X = P({1,2,3,4}), we may take A = 
{1,2} and B = {2,3} to see that (X,C) is not totally ordered. On the other hand, if 
S = {0, {1}, {1,3}, (1, 3, 4}, (1, 2,3, 43}, then (S, C) is a totally ordered set. 


Constructing New Posets 


We now give more examples of posets by introducing methods of constructing new posets 
from old ones. The first construction lets us turn a subset of a poset into a partially ordered 
set. 


6.68. Definition: Subposets. Given a poset (X, <) and a subset S of X, define a relation 
<’ on S as follows: for all a,b € S, let a <’ b iff a < b. The relation <’ is called the restriction 
of < to S, and (S, <’) is called a subposet of (X, <). 
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The poset axioms for (S, <’) follow immediately from the poset axioms for (X, <). For 
example, to check antisymmetry of <’, suppose a,b € S satisfy a <’ b and b <’ a. Then 
a<bandb<a,soa=b because < is antisymmetric. Every subposet of a totally ordered 
set is also totally ordered. On the other hand, it is quite possible for (S,<’) to be totally 
ordered even when (X,<) is not totally ordered. We saw an example of this above when 
(X, <) is (P({1, 2, 3, 4}), C). For another example, taking S = {2 : k € Zso} gives a totally 
ordered subposet of the divisibility poset (Z>o, |). In general, given a poset (X,<) and a 
subset S of X, S is called a chain in X iff the subposet (S, <’) is totally ordered. 

We can concatenate a list of non-overlapping posets to obtain a new poset, as follows. 


6.69. Definition: Concatenation Posets. Let (X1, <1), (X2, <2), ..., (Xn, <n) be a list 
of posets such that the sets X1, X2,..., Xn are pairwise disjoint (i.e., X; O X; = Ø for all 
i Æ j). Define a relation < on X = X1 U X2 U --- U Xn as follows: for a,b € X with a € X; 
and b € Xj, let a < b iff i < j, or i = j and a <; b. The pair (X, <) is a poset called the 
concatenation of X1,..., Xn- 


Let us check that < is transitive. Fix a,b,c € X, assume a < b and b < c, and prove 
a < c. We know there are unique i,j,k € {1,2,...,n} with a € X;, b € Xj, and c € Xx. 
Consider various cases. Case 1. Assume i < j < k. Then i < k (by transitivity of < on Z), 
so a < c. Case 2. Assume i < j = k. Here i < k, so a < c. Case 3. Assume i = j < k. Here 
i < k, so a < c. Case 4. Assume i = j = k. Here we know a <; b and b <; c, so a <; c by 
transitivity of <;. Thus, a < c follows by definition of <. You can check that < is reflexive 
on X and antisymmetric. Moreover, if every (X;, <;) is totally ordered, then (X, <) is also 
totally ordered. 

Our next construction gives a way to turn the product of posets into a poset. 


6.70. Definition: Product Posets. Given posets (X1, <1), (X2, <2), ..., (Xn, <n), define 
a relation < on X = Xı x X2x- - -xX Xn as follows: for all a = (a1,...,@n) and b = (b1,..., bn) 
in X, define a < b iff a; < b; for all i € {1,2,...,n}. The pair (X, <) is called the product 
poset with factors (X;,<;). 


We check that (X, <) is transitive, letting the reader verify reflexivity and antisymmetry. 


Fix a,b,c € X, and write a = (aj,...,d,), b = (b1,...,bn), € = (C1,---,€n) for some 
a;,b;,c; E€ X;. Assume a < b and b < c, which means a; < b; and b; < c; for all i € 
{1,2,...,n}. Since each relation <; is transitive, we deduce a; < c; for all i, so a < c holds. 


The product of totally ordered posets is usually not totally ordered. For example, let 
X = {1,2,3} and Y = {1,2,3,4} considered as subposets of (Z, <). In the product poset 
(X x Y, <), we have (1,4) A (2,3) and (2,3) Z (1,4), so this poset is not totally ordered. 
In the exercises, we describe another ordering on product sets called lexicographic ordering, 
which is a total ordering if all the factors are totally ordered. 


Greatest and Least Elements 


Given a subset S of a partially ordered set, it is often useful to single out the largest and 
smallest elements in S (when such elements exist). 


6.71. Definition: Greatest and Least Elements. Let (X,<) be a poset and SC X. 


For all z, | z is the greatest element of S |iff |z € S and Vy E€ S,y < z. 


For all x, | x is the least element of S | iff |x € S and Vy € S, x < y. 


We sometimes write | z = max S'] to mean that z is the greatest (maximum) element of S, 


and | z = min S| to mean that x is the least (minimum) element of S. 
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The next examples show that not all subsets have greatest and least elements. But when 
these elements exist, they are unique. For suppose zı and z2 are both greatest elements of 
S. Since zı is a greatest element and z2 E€ S, z2 < z1. Since z2 is also a greatest element 
and 21 E€ S, z1 < z2. Then 21 = z2 follows from antisymmetry. Similarly, least elements are 
unique when they exist. 


6.72. Example. In the poset (Z,<), the subset Zs has least element 1 but no greatest 
element. The subset Z<s5 has greatest element 5 and no least element. The subset Ø has 
no least element and no greatest element. Similarly, Z itself has no least element and no 
greatest element. Let us prove carefully that the set Zoqa of all odd integers has no greatest 
element. Assume, to get a contradiction, that z € Zoaq is the greatest element of this subset. 
Then z+ 2 is also odd, and z+2 < z is not true. This contradiction proves that no greatest 
element exists. 


6.73. Example. In the poset (R,<), the half-open interval (0,1] = {£r ER:0< x< 1} 
has greatest element 1 but no least element. Note that 0 is not the least element of (0, 1] 
because it does not belong to this subset. For any xo € (0,1], 2 cannot be the smallest 
element of (0, 1] since 19/2 is in this subset and r9/2 < xo. Similarly, for all real a < b, we 
see that [a,b) has least element a but no greatest element, and (a,b) has no least element 
and no greatest element. 


6.74. Example. For any set X, the poset (P(X), C) consists of all subsets of X ordered 
by set inclusion. This poset has least element Ø and greatest element X. However, you 
can show that the subset S of P(X) consisting of all nonempty proper subsets of X has 
no least element and no greatest element. For example, consider X = {1,2,3}, so S = 
{{1}, {2}, {3}, {1, 2}, {1, 3}, {2, 3}}. No one of these six sets is a subset of all other members 
of S, so S has no least element. Similarly, no one of these six sets contains all the other sets 
as subsets, so S has no greatest element. 


6.75. Example. Let X be the set of positive divisors of 36, ordered by divisibility. X 
has least element 1 and greatest element 36. Removing these two elements, we get the 
subset S = {2,3,4,6,9, 12,18} ordered by divisibility. You may check that S has no least 
element and no greatest element with respect to this partial order. Of course, with respect 
to the standard ordering <, S has least element 2 and greatest element 18. On the other 
hand, now take X to be the set of positive divisors of 32, ordered by divisibility. Here 
X = {1,2,4,8,16,32} is totally ordered by |. You may verify that every nonempty subset 
of X has a least element and a greatest element with respect to |. 


6.76. Example. Suppose (X1, <1),...,(Xn,<n) are nonempty posets. The concatenation 
poset (X,<) has a least element iff (X1, <1) has a least element, and these elements are 
the same. Similarly, the greatest element of (X,<) is the greatest element of (Xn, <n) if 
the latter exists. On the other hand, a = (aj,...,@,) is the least element of the product 
poset (X1 x +- X Xn, <) iff a; is the least element of (X;,<;) for all i; likewise for greatest 
elements. 


Well-Ordered Posets 


In the examples above, we have seen many posets (X,<) with no least element. We have 
also seen that X itself may have a least element, but there could exist nonempty subsets 
S of X where S has no least element. For example, this happened when X = R and £ is 
an open interval (a,b). We now give a special name to partially ordered sets where every 
subset (other than Ø) has a least element. 
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6.77. Definition: Well-Ordered Posets. A poset (X,<) is | well-ordered | iff 
every nonempty subset of X has a least element. 


A well-ordered poset must be totally ordered. For if (X, <) is well-ordered and a,b € X, 
then S = {a,b} is a nonempty subset of X. So this subset has a least element. If a is the 
least element of S, then a < b. If b is the least element of S, then b < a. Thus, a < b or 
b <a holds. 

It is intuitively plausible that every totally ordered finite set is well-ordered. We delay 
the formal proof until §7.1, where finite sets are officially defined. On the other hand, (Z, <) 
and (R, <) are not well-ordered sets, since they have no least element. Although (Ro, <) 
has a least element (zero), this poset is not well-ordered either, since its subset Rs» has no 
least element. Intuitively, (Z>0, <) and (Zo, <) are well-ordered sets, although the proof 
of this fact is not obvious. In some formal developments of the integers, the well-ordering 
of Zs is postulated as an axiom. 


6.78. Well-Ordering Property of Zo. The poset (Zs, <) is well-ordered. 


We now show that the Well-Ordering Property is logically equivalent to the Induction 
Axiom from §4.1. [This proof may be considered optional.] First, we assume the Induction 
Axiom holds and prove the Well-Ordering Property. Fix a subset S of Z>o that has no 
least element; we must prove that S = Ø. To do so, we use strong induction to prove the 
statement Vn € Zyo,n Æ S. Fix no € Zyo. By strong induction, we may assume that for all 
integers m in the range 0 < m < no, m ¢ S. Using this, we must prove that no ¢ S. Assume, 
to get a contradiction, that no € S. We derive a contradiction to our original assumption 
on S by showing that no is the least element of S. Fix k € S, so k is a positive integer. We 
cannot have k < no, since otherwise the induction hypothesis would give k ¢ S. Because 
(Z>o, <) is totally ordered, we conclude that no < k. We have now shown that ng is the 
least element of S. This contradiction forces no ¢ S, completing the proof by induction. 

Next, we assume the Well-Ordering Property holds and prove the Induction Axiom. Fix 
an open sentence P(n). As discussed in §4.1, it is enough to prove that the statement Q: 
“P(1)AVn E Zso, (P(n) > P(n+1))” implies the statement R: “Vn € Zso, P(n).” We use 
proof by contradiction, assuming that Q is true and R is false. To use the Well-Ordering 
Property, we need a nonempty subset of the positive integers. We define this subset to be 
S = {n € Z>o :~P(n)}, the set of positive integers for which P(n) is false. A useful denial 
of R is dn € Zso,~P(n), which means that S is nonempty. The Well-Ordering Property 
states that there is a least element of S, say no. Either no = 1 or no > 1. In the case no = 1, 
we see that P(1) is false because no belongs to S, but also P(1) is true by statement Q. 
This is a contradiction. In the case ng > 1, then no — 1 is a positive integer less than no. 
On one hand, no is the least element of S, so no — 1 ¢ S and no € S. This means that 
P(no — 1) is true and P(no) is false. On the other hand, taking n = np — 1 in statement Q, 
we see that P(no— 1) => P(no) is true. This contradicts the truth table for IF, and the proof 
is complete. [The preceding proof tacitly uses certain additional facts about integers. For 
example, we needed to know that for any integer ng > 1, no — 1 is also a positive integer. 
For a careful development of the integers where all necessary properties are derived from 
axioms, see Chapter 8. In particular, compare the discussion here to Lemma 8.41 and the 
proof of Theorem 8.51.] 


Section Summary 


1. Partial Orders and Total Orders. A relation < on a set X is a partial order iff < is 
reflexive on X, transitive, and antisymmetric. This means that for all x,y,z € X, 
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x < x; if x < yand y < z, then z < z; and if x < y and y < x, then x = y. A 
total order is a partial order such that for all x,y € X, £x <y ory< rz. 


2. Examples of Posets. Any set of sets is partially ordered by set inclusion. Any set of 
positive integers is partially ordered by divisibility. Any subset of a poset is a poset 
with the restricted ordering. The concatenation of a list of pairwise disjoint posets 
is a poset. The product of posets is a poset if we let (a1,...,an) < (b1, ...,bn) 
mean a; <; b; for all i. 


3. Greatest and Least Elements. If S is any subset of a poset (X, <), z is the greatest 
element of S iff z € S and Vy € S,y < z; x is the least element of S iff x € S 
and Vy € S,x < y. Greatest and least elements do not always exist, but they are 
unique when they do exist. 


4. Well-Ordered Sets. A poset (X,<) is well-ordered iff every nonempty subset of 
X has a least element. The Well-Ordering Property states that Z>o with the 
standard ordering is well-ordered. This property is equivalent to the Induction 
Axiom. 


Exercises 


1. Which relations shown in Figure 6.3 are antisymmetric? 


2. Give a specific example of a relation R that is not symmetric and not antisym- 
metric. 


3. Find all total order relations on the set X = {1, 2,3}. 


Draw Haase diagrams for each poset. 

(a) the set of positive divisors of 6 ordered by divisibility. 

(b) the set of positive divisors of 8 ordered by divisibility. 

(c) the set of positive divisors of 100 ordered by divisibility. 
(d) the set of all subsets of {a, b, c,d} ordered by set inclusion. 


5. Let X = {1,2,3,4,5}. A Haase diagram for the poset (X,<) has arrows from 1 
to 2 to 4 to 5 and arrows from 1 to 3 to 5. (a) Draw this diagram. (b) Describe 
the relation < as a set of ordered pairs. (c) Confirm that < really is a partial 
order on X. 

6. (a) Find two different chains of maximum size in the poset (P({1, 2,3, 4}), ©). 
(b) Find all chains of maximum size in the poset of positive divisors of 75 ordered 
by divisibility. 

7. Suppose (X, <) isa poset. Define a new relation > on X as follows: for alla,b € X, 
define a > b iff b < a. Prove that > is a partial order on X, and > is a total 
ordering iff < is a total ordering. 

8. (a) Suppose X is a fixed set, and for each 7 in an index set I, R; is a partial 
order on X. Show that R = fez Ri is also a partial order on X. (b) If every 
R; is a total order, must R be a total ordering on X? Prove your answer. (c) Is 
S = U;cz Ri a partial order on X? Prove your answer. 

9. (a) Let (X,<) be the concatenation of n posets (X;, <i). Prove < satisfies reflex- 
ivity and antisymmetry. (b) Prove < is a total order on X iff every <; is a total 
order on X;. 

10. Let (X,~<) be the product of n posets (Xj, <i). Prove < satisfies reflexivity and 
antisymmetry. 
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11. Suppose (J,~<) is a partially ordered index set, and for each i € I, (Xj, <i) 
is a poset. Assume the sets X; are pairwise disjoint. Define a relation < on 
X =Uje, Xi as in Definition 6.69. Prove that (X, <) is a poset. 

12. Suppose (X;, <;) is a poset for 1 < i < n. Let X = Xı x X2x---x Xn. Define the 
lexicographic order <jex on X as follows: for a = (a1,...,@,) and b = (b1,...,bn) 
in X, let a Slex b iff a = b or there exists i € {1,2,...,n} with a; = b; for all 
j <i, a; £ bi, and a; <; bi. (a) Prove the poset axioms for <jex. (b) Prove: if 
every <; is a total ordering, then <j., is a total ordering. 

13. (a) Show that the concatenation of n well-ordered sets is well-ordered. (b) Show 
that the product of n well-ordered sets, ordered lexicographically (see the previous 
exercise), is well-ordered. 

14. Let Y = P({1,2}), Z = {3,4,5,6}, and R = Iz U {(3, 4), (3,5), (3,6)}. (a) Draw 
Haase diagrams for (Y, C) and (Z, R). (b) Draw a Haase diagram for the concate- 
nation of Y and Z (in this order). (c) Draw a Haase diagram for the concatenation 
of Z and Y (in this order). 

15. Let X = {1,2,3} and Y = {0,2,4,6}. Draw a Haase diagram for the product 
poset (X,<) x (Y,<), where < is the standard ordering on Z. 

16. Determine (with explanation) whether each poset below is well-ordered. 

(a) Z>o ordered by divisibility. 
(b) Z<a ordered by <, for fixed a € Z. 

(c) the set of positive divisors of 80 ordered by divisibility. 

(d) the set of positive divisors of 81 ordered by divisibility. 

(e) [1,5] ordered by <. 

(£) {1, 2,3,4,5} ordered by <. 

17. Use the Well-Ordering Property for Zo to prove that for each a € Z, (Z>a, <) 
is a well-ordered set. 

18. Prove that every nonempty subset of (Z<o, <) has a greatest element. 


19. Define a relation R on Z by setting, for all a,b € Z, 
aRb & ((0<a<b)v(b<a<0)V(a>0^b<0)). 
Prove that (Z, R) is a poset, and this poset is well-ordered. 

20. Carefully prove the assertions in the last sentence of Example 6.73. 

21. Prove the assertions made in Example 6.76. 

22. Show that a relation R on a set X is both symmetric and antisymmetric iff 
RC Ix. 

23. A relation < on a set X is called a strict partial order on X iff < is transitive 
and for all a € X, a < a is false. (The last condition is called irreflezivity.) 

(a) Given a poset (X, <), define a < b to mean a < b and a Æ b for a,b € X. 
Show that < is a strict partial order on X. 

(b) Conversely, given a strict partial order < on a set X, define a < b to mean 
a < bor a =b for a,b € X. Show that < is a partial order relation on X. 

24. Count the number of partial order relations on X = {1, 2,3}. 

25. For a fixed set X, let SP be the set of all set partitions of X. We introduce a 
partial ordering on SP as follows: given P,Q € SP, let P < Q mean that every 
block of P is contained in a block of Q; i.e., VS € P,AT € Q,S CT. (a) Prove 
that < is a partial order on SP. (b) Draw the Haase diagram for (SP, <) when 
X = {1,2,3}. (c) Draw the Haase diagram for (SP, <) when X = {a, b,c, d}. 
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6.6 Equivalence Relations and Algebraic Structures (Optional) 


In this chapter, we have defined equivalence relations, equivalence classes, and quotient sets. 
These ideas are very abstract, and you may be wondering how these concepts could be at 
all useful. It turns out that quotient sets can be used to define new algebraic structures 
generalizing familiar number systems such as the integers. In particular, starting with Z 
and the arithmetic operations of integer addition and multiplication, we use quotient sets to 
build new number systems: the integers modulo n (denoted Z/n) and the rational numbers 
(denoted Q). In this section, we investigate how the known arithmetic operations on Z let 
us introduce new arithmetic operations on Z/n and Q. The notion of a well-defined (or 
single-valued) operation plays a key role here. We will see that facts in Z, such as the 
commutative law or the distributive law, can be used to deduce corresponding facts about 
these new number systems. 


The Integers Modulo n 


We have already introduced the integers modulo n in some earlier examples. For convenience, 
we review the relevant definitions here. Fix an integer n > 1. For all a,b € Z, recall a =, b 
means n divides a — b. The relation =, is an equivalence relation on Z, and the equivalence 
class of a € Z is [a]=„ = {a+ kn: k € Z} by Example 6.47. We abbreviate the equivalence 
class notation to [a], or [a], when n is understood. 


6.79. Definition: Integers Modulo n. For each fixed integer n > 1, Z/n is the quotient 
set Z/=,. Thus, Z/n = {[a]p, : a € Z} is the set of all equivalence classes of congruence 
modulo n. 


The next theorem shows that Z/n consists of n distinct equivalence classes. 


6.80. Theorem on the Elements of Z/n. For all n > 1, Z/n = {[0)n, [1Jn,---, [2 — Un}. 
Furthermore, the n equivalence classes [0]n, [L]n, ..., [n — In are all distinct. 


Proof. Fix n > 1. Since Z/n = {[a]n : a € Z}, the set {[O]n, [Ln,---, [2% — 1]n} is a subset of 
Z/n. To establish the reverse set inclusion, fix [a], € Z/n, where a € Z. We must prove there 
exists r € {0,1,...,2—1} with [a], = [r]n. By the Division Theorem, we know a = qn +r 
for some q,r € Z with 0 < r < n. Then n divides a — r = qn, so a =p r, so [aly = [r]n 
by part (a) of the Theorem on Equivalence Classes (§6.3). To see that [0]n, ..., [n — 1]n are 
all distinct, we assume s,t € {0,1,...,n — 1} satisfy |s]n = [t]n, and we prove s = t. Using 
the Theorem on Equivalence Classes again, we see that s =, t, so n divides s — t. Since 
0 < t< n, we see that —n < —t < 0. Adding this inequality to the inequality 0 < s < n, 
we conclude —n < s — t < n. The only integer multiple of n strictly between —n and n is 
zero, so s — t = 0, giving s =t. 


Addition and Multiplication Modulo n 
Now we introduce versions of addition and multiplication defined on integers modulo n. 


6.81. Definition: Addition and Multiplication on Z/n. Fix n > 1. For all a,b € Z, we 
define addition mod n by [a]n ®n [bln = [a+ b]n and multiplication mod n by [a]n @n [b]n = 
Ja - Dn. 


6.82. Example. Take n = 5, so Z/5 = {[0], [1], [2], [3], [4]}. We can make an addition table 
and a multiplication table for arithmetic modulo 5. In these tables, the entry in the row 
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labeled [a], and the column labeled fb]n is [a]n n [b]n (left table) or [a]n n [b]n (right 
table). 

®s | 10) [H] [2] B] [4 5 | 10) n] B] BI Ll 

(Oo) fo] a) B B] [4 [0] f {o} {o} fo] fo] {[0] 

Ga B B 4 6 [1] po a 2) BI [4 

2] | 2] B) [4] [5] [6 [2] | [0] [2] [4] [6] [8] 

[3] B] 4 [5] [6] [7 [3] ) [0] [3] [6] [9] [12] 

[4] | [4] [5) [6] [7 [8 [4] | [0] [4] [8] [12] [16] 
We can get more informative tables by changing the names of the outputs so that every 
entry in the table is one of the standard names [0], [1], [2], [3], or [4]. For example, [3] @5 [4] = 


[7] = [2] and [3] 8s [4] = [12] = [2]. Making these changes produces the following tables: 
®s | [0] [1] 2) [3] [4] @5 | [0] [1] 2) [3] [4] 
[0] | [0] H] R] B] [K [0] | [o] [o] [oO] [o] [0] 
[1] |) R] [B] [4 [0] [1] [0] m] 2) B) A 
[2] | 2] [B] [4] [o] GJ [2] | [0] 2] [4] [1] B] 
[3] | [B] K] [0] [1] B] [8] | [0] [B] [1] [4] B] 
[4] | (4) o] [21] B [B] [4] | [0] 4) B) B E 
We must now discuss a crucial technical point about the definitions of 6, and &n. Just 


as the outputs of these operations have many names, so also do the inputs. For instance, 
when we computed [3] @5 [4] = [12], we chose the particular names [3] and [4] for the 
equivalence classes {3 + 5k : k € Z} and {4 + 5k : k € Z} that are being combined by the 
new addition operation. What happens if we use different names for these two inputs? For 
instance, we know [3] = [8] and [4] = [—6] (mod 5) by the Theorem on Equivalence Classes. 
Using the new names, we would compute [8] ®5 [—6] = [—48]. This may at first seem to 
disagree with our previous answer of [12], until we realize that [—48] = [2] = [12]. However, 
it is not yet clear that we must always get the same output (possibly with a new name) 
whenever we change the names of the two inputs to the operations ©, and pn. 

To resolve this issue in general, we need to address the following question. Suppose 
n > 1 is fixed, and a,a’,b,b’ are integers such that [a], = [a], and [b], = [b']n. Using 
the original names [a], and [b],, we would compute [a]n On [b]n = [a + bln. Using the new 
names [a’],, and [b']n, we would instead compute [a’]n ®n [b']n = [a’ + b']n. Is it always true 
that [a + 6], = [a’ + b']n? The answer is yes, thanks to some previous theorems. By the 
Theorem on Equivalence Classes (§6.3), our assumptions on a,a’,b,b’ mean that a =n a’ 
and b =, b’. Next, by part (b) of the Theorem on Congruence Modulo n (§6.2), we see that 
a+b =n a’ +b’. So the Theorem on Equivalence Classes tells us that [a +b]n = [a’ +0'],. We 
have now proved that changing the names of the inputs to 6, does not change the output 
of the operation (although the name of the output may very well change). More briefly, we 
say that @,, is a well-defined operation. Similarly, to say that ®, is well-defined means that 


Va, b,a’,b' € Z, (laln = la'n A [bln = [0] n) > [ad]n = [a'b]. 


The proof we gave above works here, this time using part (c) of the Theorem on Congruence 
Modulo n. 

We can rephrase our discussion in a more formal way that makes contact with our 
definition of a function. Addition mod n is really a function S : Z/nx Z/n > Z/n such that 
S([a]n, (bln) = [an n [b]n = [a+b]n for all a,b € Z. The graph of this addition function is, by 
definition, the set of ordered pairs G = {(([a]n, [b]n), [a +b]n) : a,b € Z}. To confirm that S 
really is a function, we must check three things (see $5.4): (a) G C (Z/nxZ/n) x Z/n; (b) for 
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all x € Z/nxZ/n, there exists y € Z/n with (x,y) € G; and (c) for all z, a’ € Z/nxZ/n and 
all y,y’ € Z/n, if (x,y) € G and (2’,y’) € G and x = 2’, then y = y’. Part (c) tells us that 
the proposed function S is single-valued: putting equal inputs x and x’ into the function 
must produce equal outputs y and y’. This is exactly what we checked above, taking x to 
be the ordered pair ([a]n,[b]n), x’ to be the ordered pair ([a']n, [b']n), y = [a+ dn, and 
y’ = [a’ + b']n. Conditions (a) and (b) are immediate in this example, though in other 
situations these conditions might need to be checked explicitly. 


Algebraic Properties of Operations Modulo n 


In §2.1, we listed some algebraic properties of the addition and multiplication operations on 
the set of integers. Almost all of these properties extend to the new operations of addition 
and multiplication modulo n on Z/n. Furthermore, now that we know the latter operations 
are well-defined, we can quickly deduce properties for Z/n from the corresponding properties 
for Z. 


6.83. Theorem on Addition and Multiplication Modulo n. For all n > 1, Ax- 
ioms 2.4(a) through (j) in §2.1 (closure, commutativity, associativity, identity, additive in- 
verses, and the distributive law) hold with Z replaced by Z/n, + replaced by ®n, - replaced 
by @n, 0 replaced by [0],,, and 1 replaced by [1]n. 


Proof. We prove a few axioms to illustrate, leaving the others as exercises. To show that 
addition modulo n is commutative, fix two elements of Z/n, say [a], and [b], for some 
a,b € Z. We compute 


Il 


la]n On [b]n [a+ dn (by definition of @,) 
= |b+a]n (since integer addition is commutative) 


= [bln ®n [aln (by definition of n). 


To prove the distributive law, fix three elements of Z/n, say [a]n, [b]n, [c]n where a,b,c € Z. 
We compute 


[a]n Qn ([b]ln Bn [eln) = [aln @n[b+eln (by definition of Pn) 
Jn (by definition of &n) 
(a-b)+(a-c)|n (by the distributive law in Z) 
a- bln On [a-cln (by definition of @,,) 
= (laln @n [b]n) On (laln Qn [eln) (by definition of 8n). 
Assume we have already proved that [0], is the identity element for n. To prove that 


additive inverses exist, fix [a], € Z/n, where a € Z. Note that —a € Z, so |—a]n is an 
element of Zn, and 


[@]n Gn [-a]n = [a + (—a)]n = [0]n. 


Thus, [—a]n is an additive inverse for [a], in Z/n. 


The question of whether Z/n has zero divisors or multiplicative inverses for nonzero 
elements is more subtle. Some of the exercises ask you to show that when n is prime, 
every nonzero element of Z/n has a multiplicative inverse modulo n, and there are no zero 
divisors. On the other hand, when n > 1 is not prime, Z/n has zero divisors and some 
nonzero elements of Z/n do not have multiplicative inverses. 
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Constructing Rational Numbers from Integers 


In most of this book, we have taken the set R of real numbers as our starting point, and we 
have defined the set Q of rational numbers as a particular subset of R (namely, the subset 
of real numbers of the form a/b, where a € Z and b € Zzo). However, it is also possible to 
build the set of rational numbers from the more basic set of integers. We can then introduce 
addition, multiplication, and an ordering relation on Q using the corresponding concepts for 
Z, and prove that these new entities obey the expected axioms. These constructions provide 
an excellent example of an algebraic application of equivalence relations and quotient sets. 

Intuitively, we are going to build Q from Z using the notion of fractions, which we have 
already mentioned above. We want a rational number to be a ratio a/b of two integers a 
and b, where b cannot be zero. The main difficulty is that the division operation for real 
numbers is no longer available, so we cannot interpret a/b as “a divided by b.” We could try 
to get around this by introducing the ordered pair (a, b) as a formal model for the intuitive 
fraction a/b. 

But now a new difficulty arises: there are many ways to represent a particular rational 
number in the form a/b. For example, 3/5 = 6/10 = 9/15 = —3/—5 = ---. However, 
the ordered pairs (3,5), (6,10), etc., are not equal! Now the key idea presents itself: we 
would like to regard all of these ordered pairs as equivalent for the purposes of representing 
fractions. This suggests introducing an equivalence relation on the set of ordered pairs, 
and using the equivalence class of (a,b) as the formal model for the fraction a/b. We want 
(a,b) to be equivalent to (c, d) iff a/b = c/d, but the latter condition involves the forbidden 
notion of division. So we rewrite the condition as ad = bc, which is a statement that only 
involves integers and multiplication of integers. At last, we are ready for our new definition 
of rational numbers. 


6.84. Definition: Rational Numbers. Let X = Z x Zz be the set of ordered pairs 
(a,b) where a,b are integers and b Æ 0. Define an equivalence relation ~ on X by setting 
(a,b) ~ (c,d) iff ad = be. The equivalence class [(a,b)|~ is denoted a/b or ¢. The set X/~ 
of all equivalence classes is denoted Q. Elements of Q are called rational numbers. 

You checked that ~ is an equivalence relation on X in Exercise 14 of 86.2. Look at 
your solution to that exercise and make sure your proof did not use division! For instance, 
here is the verification that ~ is transitive. Fix a,b,c,d,e, f € Z with b,d, f 4 0; assume 
(a,b) ~ (c,d) and (c,d) ~ (e, f); prove (a,b) ~ (e, f). We have assumed ad = bc and 
cf = de; we must prove af = be. Multiplying the assumptions, we get (ad)(cf) = (bc)(de), 
or (afc)d = (bec)d since integer multiplication is commutative and associative. We can 
rewrite this as (afc — bec)d = 0 by the distributive law. Since d 4 0 and Z has no zero 
divisors, we conclude afc—bec = 0, or (af)c = (be)c. If c is nonzero, we can similarly deduce 
that af = be, as needed. If c = 0, we have ad = bc = 0 = cf = de. Since d is nonzero, we 
see that a = e = 0, so af = 0 = be, as needed. 

Let a,b,c,d € Z with b,d Æ 0. By the Theorem on Equivalence Classes and the def- 
inition of ~, [(a,b)|~ = [(c,d)]~ iff (a,b) ~ (c,d) iff ad = bc. We can now officially say 
that | a/b = c/d | iff | ad = bc |, provided we remember that the expressions a/b and c/d are 


abbreviations for the equivalence classes [(a, b)]~ and [(c, d)]~. 


Addition and Multiplication of Rational Numbers 


The next step in the formal development of Q is to introduce the operations of addition 
and multiplication for rational numbers. As in the case of Z/n, the key initial point is to 
verify that these operations are well-defined, not depending on which names we use for the 
inputs to the operations. 
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6.85. Definition: Addition and Multiplication in Q. Given rational numbers a/b and 


a c ad+bc a c ac 
d d fi } = d . — Pa 

SMES Chg 5 tg ee 

Using the more cumbersome equivalence class notation for fractions, this definition says 

that [(a, b)] + [(c, d)] = [(ad + be, bd)] and [(a, b)] - [(¢, d)] = [(ae, bd)]. 

6.86. Lemma. The operations + and - on Q are well-defined. 

Proof. Fix a,b,c, d,a’,b',c’,d’ € Z with b,d,b',d' £0. Assume a/b = a'/b' and c/d = d /d', 

which means ab’ = ba’ and cd’ = dc’. To show that addition is well-defined, we prove that 

a/b + c/d = a'/b' + d /d'. Expanding the definition, we must show that (ad + bc)/(bd) = 

(a'd! +b'c')/(b'd'), which means (ad + bc)b'd’ = bd(a'd’ + b'c’). Working in Z, we compute 

(ad + bc)b'd' = adb'd' + bcb'd' = ab’ dd! + cd'bb' = ba'dd' + de'bb! = bd(a'd' + b'c’), 

as needed. To show that multiplication is well-defined, we prove that (a/b): (c/d) = (a’/b’)- 

(c'/d’), which means (ac)/(bd) = (a'c')/(b'd'). Here we must prove acb'd’ = bda'c'. We 

compute acb'd' = (ab’)(cd’) = (ba’)(dc’) = bda'c', as needed. 


Now we can prove that addition and multiplication of rational numbers satisfies axioms 
similar to those in Item 2.4, and moreover every nonzero rational number has a multiplicative 
inverse. 


6.87. Theorem on Algebraic Operations in Q. 
(a) Closure: For alla,y€E€Q,r+yEeQandaz-yEQ. 


(b) Commutativity: For allz,y€Q,a+y=yteuanda-y=y-u. 

(c) Associativity: For all x,y,z E€ Q, (x +y) +2 =x + (y +z) and (x-y): z =x- (y: z). 
(d) Identity: For all x € Q, x + (0/1) =a and a- (1/1) = z. 

(e) Additive Inverses: Vx € Q, Jy € Q, x +y = 0/1. 
(£) 
( 


f) Multiplicative Inverses: Vx € Q, if x # 0/1, then Jy € Q, z - y = 1/1. 
g) No Zero Divisors: For all x,y € Q, if x - y = 0/1, then z = 0/1 or y = 0/1. 


Proof. We prove a few parts as illustrations, leaving the others as exercises. First we prove 
that addition is commutative. Fix x,y € Q, and write x = a/b and y = c/d for some 
a,b,c,d € Z with b, d # 0. Note that x+y = (a/b) + (c/d) = (ad +bc)/ (bd), whereas y +£ = 
(c/d) + (a/b) = (cb + da)/(db). Since addition and multiplication in Z are commutative, 
ad + bc = cb + da and bd = db, so x + y = y + x. Next we prove that multiplication is 
associative. Fix x,y,z € Q, and write x = a/b, y = c/d, z = e/f for some a,b,c,d,e, f E Z 


with b, d, f # 0. On one hand, (z: y): 2 = 1$ = ae On the other hand, x - (y - z) = 


` af = bay: Since multiplication in Z is associative, (x-y)-z=2-(y-2z) follows. To check 
the additive identity axiom, fix x € Q, and write x = a/b for some a,b € Z with b Æ 0. 
Compute z+ 0/1 = a/b+0/1 = (a-1+06-0)/(b- 1) = a/b = z. To check the multiplicative 
inverse axiom, fix x € Q with z # 0/1. Write x = a/b with a,b € Z and b ¥ 0. Since 
a/b #0/1,a-14 6-0, and hence a £ 0. Thus, y = b/a is a member of Q, and we compute 
x- y = (a/b) - (b/a) = (ab)/(ba). The fraction (ab)/(ba) equals 1/1 because (ab)1 = (ba)1. 
Thus, zy = 1/1, as needed. 


a ce a(ce) 


We usually write 0 instead of 0/1 and 1 instead of 1/1. More generally, for any integer 
n, we often identify n with the rational number n/1. You can check that for all n,m € Z, 
n/1+m/1 = (n+m)/1; (n/1)-(m/1) = (nm)/1; and if n 4 m, then n/1 4 m/1. So the 
identification of n with n/1 preserves equality, addition, and multiplication of integers. This 
identification also allows us to view Z as a subset of Q. 
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Section Summary 


Í; 


Integers Modulo n. For n € Z»o, Z/n consists of the n distinct equivalence classes 
[O]n; [L]n, .--, [n— 1]n of the equivalence relation =,,. For all a,b € Z, [a]n = [b]n iff 
a =n b iff n divides a — b. The operations [a]n Sn [b]n = la +b]n and [a]n 8n [bln = 
[ab], are well-defined and satisfy closure, commutativity, associativity, identity, 
distributive laws, and existence of additive inverses. 


Well-Defined Operations. To see that an operation defined on equivalence classes 
is well-defined, one must check that changing the names of the inputs to the oper- 
ations does not change the value of the output (although the name of the output 
may change). For example, saying that @,, is well-defined means Va, b, a’, b € Z, 
if [a], = [a']n and [b]n = [b']n, then [a + b], = [a’ + Bn. 

Rational Numbers. Formally, Q = X/~, where X = Z x Zyo and (a,b) ~ (c,d) 
iff ad = bc (where a,b,c,d € Z and b,d # 0). The equivalence class [(a, b)]~ is 
denoted a/b or ¢. The operations a/b + c/d + (ad + be)/(bd) and (a/b) - (c/d) = 
(ac)/(bd) are well-defined and satisfy closure, commutativity, associativity, iden- 
tity, inverse, and distributive laws. 


Exercises 


1. 


10. 


11. 


12- 


Make an addition table and multiplication table for Z/6. Does Z/6 have zero 
divisors? 

Make an addition table and multiplication table for Z/7. Does Z/7 have zero 
divisors? 

Fix n € Zyo, and define f : Z/n > Z/n by f([a]n) = [-a]n for all a € Z. Prove 
that f is well-defined. 


Prove the remaining axioms for addition in Theorem 6.83. 

Prove the remaining axioms for multiplication in Theorem 6.83. 

(a) Find all x € Z/12 for which there exist y € Z/12 with x 12y = 1. (b) Repeat 
(a) for Z/13. (c) Repeat (a) for Z/16. 

Suppose p is prime. (a) Prove: for all x € Z/p, if x # [0], then there exists y € Z/p 
with x Qp y = [1]p. [Hint: What is ged(x, p)?] (b) Prove Z/p has no zero divisors, 
i.e., for all x,y € Z/p, if £ 8p y = [0]p, then x = [0], or y = [0]p. 

Suppose n > 1 is not prime. (a) Prove there exist nonzero z, y E€ Z/n with c@ny = 
[0]n. (b) Prove that some nonzero elements of Z/n do not have multiplicative 
inverses. 

Prove the remaining parts of Theorem 6.87. 

Define i : Z > Q by i(m) = m/1 for m € Z. Prove that i is one-to-one and for 
all m,n € Z, i(m +n) = i(m) + i(n) and i(mn) = i(m)i(n). [The function i lets 
us identify Z with a subset of Q.] 

Which of the following functions are well-defined? Prove your answers. 

a) f: Q> Z given by f(m/n) =m for m,n € Z with n £ 0. 

b) g : Q4o > Q given by g(m/n) = n/m for m,n € Zyo. 

c) h : Q > Q given by h(m/n) = 3m/n for m,n € Z with n ¥ 0. 

d) k : Q > Q given by k(m/n) = (m + 5)/n for m,n € Z with n £0. 

a) Prove: for all a,b,c € Z with b,c 4 0, a/b = (ac)/(bc). (b) Show that for 
every x € Q, there exists a unique pair (a,b) € Z? with b > 0, gcd(a, b) = 1, and 


( 
( 
( 
( 
( 
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x = a/b. (Informally, the particular name a/b for x is called the fraction in lowest 
terms representing zx.) 


13. Ordering on Q. Given x,y E€ Q, we know x = a/b and y = c/d for some a,b,c,d € 
Z with b,d € Z>o (see the previous exercise). Define a relation < on Q by setting 
x < yiff ad < be. (a) Prove that this relation is well-defined. (b) Prove that < is a 
total ordering of Q. Also prove that for all m,n € Z, m < n (in Z) iff m/1 < n/1 
(in Q). 

14. Fix n € Zs,. Suppose we try to define an ordering relation < on Z/n as follows: 
for all a,b € Z, let [a], < [b], iff a < b. Give a precise explanation of why this 
construction fails. 


15. Let X be a fixed set. Define a relation ~ on X x X as follows: for a,b,c,d € X, 
let (a,b) ~ (c,d) iff (a=cAb=d)V (a=dAb=c). 
(a) Prove ~ is an equivalence relation on X x X. 
(b) Let Y be the set of unordered pairs {a,b} with a,b € X. Define f : X/~x> Y 
by f({(a, b)|~) = {a,b}. Prove that f is a well-defined bijection. 


16. Let R be a reflexive, transitive relation on a set X. (a) Define a relation ~ on 
X as follows: for all a,b € X, let a ~ b iff aRb and bRa. Prove that ~ is an 
equivalence relation. (b) Define a relation < on the quotient set X/~ as follows: 
for all a,b € X, [a]~ < [b]~ iff ab. Prove that < is well-defined and (X/~, <) 
is a poset. (c) Discuss the effect of the preceding construction when X = Z and 
R= {(a,b) € Z? : ald}. 

17. This problem outlines a formal construction of Z (the set of all integers) from Z>o 
(the set of nonnegative integers). When solving this problem, take care not to use 
subtraction or additive inverses. Assume we know: Vz, y E€ Z59,e +1 =y+1> 
x+y. (a) Prove by induction on n: Vz,y,n E€ Z>0,£ +n = y+n => z = y. (b) Let 
X = Z>o Xx Z>0, and define a relation ~ on X by (a,b) ~ (c,d) if a+d=b+c 
for a,b,c,d € Z>o. Prove that ~ is an equivalence relation on X. Denote the 
equivalence class [(a,b)] by [a — b], and let Z be the set of all such equivalence 
classes. (c) For a,b,c,d € Z>o, define [a — b] + [c — d] = [(a + c) — (b + d)] and 
[a—b]-[c—d] = [(ac+bd) —(ad+bc)]. Prove that these operations are well-defined. 
(d) Check the axioms in 2.4, assuming the corresponding facts for Z>9 are known. 
(e) Show: for every x € Z, there exists a unique a € Zso such that x = [a — 0] or 
x = [0-a]. 

18. Let f : X > Y be a function, let Z = f[X] be the image of f, and let R be the 
equivalence relation ~f. Show that g : X/R > Z, given by g({a]lr) = f(a) for 
a € X, is well-defined and bijective. 


7 


Cardinality 


7.1 Finite Sets 


The theory of cardinality gives us a rigorous way to measure and compare the size of various 
sets (finite or infinite). The key idea is that two sets X and Y have the same size (cardinality) 
iff there exists a bijection from X to Y. This definition has many surprising consequences 
for infinite sets that may challenge your intuition. For example, not all infinite sets have the 
same size. We will see that, in a sense to be made precise later, R is a strictly larger infinite 
set than Q, and P(R) (the set of all subsets of R) is larger than R. On the other hand, 
Zso and Z and Q all have the same cardinality, even though we have strict set inclusions 
Z>o GZ Q Q. We begin our study of cardinality by defining finite sets and proving their 
basic properties. Our theorems about finite sets should agree with your intuition and may 
even seem to be obvious, but some of these results are surprisingly tricky to prove carefully. 


Definition of Finite Sets 


In ordinary life, we count a set of objects by pointing to each object in the set and saying 
“one, two, three, ....” This verbal procedure is essentially setting up a one-to-one corre- 
spondence (bijection) from some set of integers {1,2,...,n} onto the given set, as shown 
in Figure 7.1. We formalize this idea in the next definition. Here and below, we use the 
notation {1,2,...,n} to denote the set {1€ Z:1<i<n}. 


f 


FIGURE 7.1 
Counting a set of objects. 


7.1. Definition: n-Element Sets. Let X be a set. For all n € Zyo,|X has n elements 
iff | there exists a bijection f : {1,2,...,n}— X |. We say | X has 0 elements | iff} X = 0. 


For n € Z>0, | |X| = n | means that X has n elements. 


Since a function f is a bijection iff f7! exists and is a bijection (see §5.9), we can also 
say that X has n elements iff there is a bijection g : X —> {1,2,...,n}. 
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7.2. Definition: Finite and Infinite Sets. For all sets X, 
X is finite | iff | In € Z>0, |X| =n |. | X is infinite | iff | X is not finite |. 


Properties of Finite Sets 


The next theorem lists the main properties of finite sets. 


7.3. Theorem on Finite Sets. For all sets X, Y, X1,...,Xs (where s € Z>o): 

(a) Cardinality is Well-Defined: For all n,m € Zso, if |X| = n and |X| = m, then n = m. 
(b) Subset Property: If |X| = n and Y C X, then dm € Zso, |Y| =m and m < n. 

(c) Sum Rule: If |X| =n and |Y| = k and X AY = f, then |X UY|=n+hk. 


(d) Sum Rule for s Sets: If |X;| = ni for 1 < i < s and Xi N X; =0 for all i Æ j, 
then |X U X2 U- U X| = nı +n +: + ns. 


(e) General Sum Rule: If |X| = n, |Y | = k, and |X N Y| = i, then |XXUY|=n+k-—i. 


(£) Product Rule: If |X| = n and |Y| = k, then |X x Y| = nk. 

(g) Product Rule for s Sets: If |X;| = ni for 1 < i < s, then | X1 x X2 X- -X Xs| = Nina- Ns. 
(h) Power Set Rule: If |X| = n, then |P(X)| = 2”. 

(i) Finite Set Operations Preserve Finiteness: If Xı,..., X. are all finite, then U}; Xi, 


Ni Xi, X1 x +++ x Xs, and P(X) are also finite. Every subset of a finite set is finite. 


The first two parts are examples of intuitively plausible properties that are not so 
straightforward to prove rigorously. We give proofs of these results in an optional subsection 
later. The remaining items are proved below. Throughout this chapter, we frequently use 
the following terminology. Sets X and Y are called disjoint iff X NY = Q; and a list of sets 
X1,...,Xs5 is pairwise disjoint iff X; O X; =0 for alli # j with 1 < i,j < s. 


The Sum Rule 


Let us now prove the Sum Rule (part (c) of Theorem 7.3). Assume X and Y are disjoint 
sets with |X| = n and |Y| = k; we must prove |X U Y| = n + k. Our assumption means 
there exist bijections f : {1,2,...,n} > X and g : {1,2,...,k} — Y; we need to build 
a bijection h : {1,2,...,n +k} + X UY. To do this, define h(i) = f(i) for 1 < i < n, 
and define h(i) = gli — n) for n+1 < i < n + k (where i denotes an integer). Figure 7.2 
illustrates the construction of h when n = 3 and k = 2. We prove h is a bijection by 
exhibiting a two-sided inverse h’: X UY > {1,2,...,n +k}. Each z € X UY is in exactly 
one of the sets X or Y, since X AY = b. If z € X, define h' (z) = f—1(z). If z € Y, define 
h'(z) = n + g`! (z). You can now prove that h and h’ are well-defined functions such that 
hoh' = Idyuy and K’ o h = Idṣ1 2... n+}. AS an example of what needs to be checked, let 
us see that h(h’(z)) = z when z € Y. By definition of h’, we know h’(z) = n+g71(z). Since 
g7! maps Y into the set {1,2,...,k}, we see that n +1 < h’(z) < n + k. Hence, applying 
the definition of h, we get h(h'(z)) = g(h' (z) — n) = g(g7'(z)) = z = Idxuy(z) = z, as 
needed. 

We can now prove the Sum Rule for s Sets (part (d) of the theorem) by induction on 
s. The result is immediate if s = 1, and it holds by what we just proved if s = 2. For the 
induction step, fix s E€ Z>2, assume the Sum Rule is known for any union of s pairwise 
disjoint sets, and prove this rule for a union of s + 1 pairwise disjoint sets. Fix arbitrary 
sets X1,...,X5,Xs41 with |X;| = n; for all i and X; N X; = Ø for all i # j. The idea is to 
apply the Sum Rule already proved (part (c) of the theorem) to the sets X = X,U---UX, 
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FIGURE 7.2 
Building h from f and g. 


and Y = Xs+1. The key observation is that 


XAY = (úx) N Xs = GN X) = LO =F. 
i=1 


i=l {=l 


So we can invoke the Sum Rule to conclude that |X U Y| = |X| + |Y|. Now, since the 
sets X1,..., Xs are pairwise disjoint, the induction hypothesis applies to show that |X| = 
nı +: +ns. Since |Y | = |Xs+1| = ns+1, we conclude that |X UY| = (ni +---+ns5)+ns41, 
completing the induction step. 

Finally, we prove the General Sum Rule (part (e) of the theorem) Fix sets X and 
Y with |X| = n, |Y| = k, and |X A Y| = i. On one hand, X = (X —Y)U(X NY) 
where the two sets X — Y and X NY have empty intersection. By the Sum Rule already 
proved, |X| = |X — Y| + |X MY]. On the other hand, X UY = (X — Y)UY where 
the two sets X — Y and Y have empty intersection. Using the Sum Rule again, we get 
|X UY| = |X — Y| + |Y|. Inserting the previous equation into this one, we find that 
IX UY| = |X|+|/Y|-—|XNY|=n+k—i, as needed. This proof tacitly used the fact that 
X —Y isa finite set (being a subset of the finite set X; see part (b) of the theorem), so 
that |X — Y| = j for some integer j. Similarly, all of our counting formulas make implicit 
use of the fact that the size of a finite set is well-defined (part (a) in Theorem 7.3). 


The Product Rule and Power Set Rule 


Before proving the Product Rule (parts (f) and (g) of Theorem 7.3), we need a preliminary 
lemma. We prove that for any n-element set X and any object b, |X x {b}| =n. Fix X and 
b with |X| = n. We know there exists a bijection f : {1,2,...,n} — X, and we must show 
there is a bijection g : {1,2,...,n} — X x {b}. To do this, define a function h : X — X x {b} 
by h(x) = (a, 6) for all x € X. We see at once that h is a bijection with two-sided inverse 
given by h7!(x,b) = x for (a,b) € X x {b}. We may now choose g = ho f, which is a 
bijection since it is the composition of two bijections. 

Now we prove the Product Rule for two sets. Fix sets X and Y with |X| = n and 
|Y| = k. We know there is a bijection y : {1,2,...,k} — Y. The idea is to slice up the 
product set X x Y into k pairwise disjoint sets X; = X x {y(i)} for 1 < i < k; see Figure 7.3 
for an example where k = 3, X = {1,2,3,4}, and Y = {1,2,4}. To see that the sets X; 
are pairwise disjoint, fix i 4 j between 1 and k. To get a contradiction, assume there is an 
ordered pair (a,b) € X; N Xj. Then y(i) = b = y(j), forcing i = j since y is one-to-one. 
This contradicts i Æ j. Next, we claim X x Y = Xı U X2U---U Xx. On one hand, fix 
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FIGURE 7.3 
Decomposing X x Y into disjoint sets X1,..., Xx. 


(a,b) € X xY. Then a € X and b = y(t) for some i, because y is onto. So (a,b) € X; for 
some i, hence (a,b) € Gian Xi. Thus X xY C es Xi; we let the reader check the reverse 
set inclusion. By the lemma, |X;| = |X| = n for 1 < i < k. Since X x Y is the union of the 
pairwise disjoint sets X1,...,X,, the Sum Rule gives 


|X x Y| = |X| ++ |X| =n +---+n (k summands) = nk. 


We can now prove the Product Rule for s Sets by induction on s, using the case just proved 
(s = 2) to complete the induction step. This proof is very similar to what we did in the 
proof of the Sum Rule for s Sets, so we leave this as an exercise. 


7.4. Example. For n € Zo, how many strings of zeroes and ones of length n are there? 
For example, when n = 3, there are eight such strings: 000, 001, 010, 011, 100, 101, 110, and 
111. The map sending a binary string a1a2---a, to the ordered n-tuple (a1, a2,..., an) isa 
bijection from the set of such strings to the product set {0,1}”. You can check that {0,1} 
is a two-element set. By the Product Rule, |{0,1}”"| = 2”. Thus, there exists a bijection 
f:{1,2,...,2"} > {0,1}. 


Aided by the preceding example, we can now prove the Power Set Rule. Fix an n-element 
set X. We can choose a bijection x: {1,2,...,n} > X so that X = {x(1),x(2),...,a(n)}. 
Define a function g : {0,1}" —> P(X) as follows. Given (a1,...,an) € {0,1}", let S = 
g(@1,...,@n) be the set of all x(i) such that a; = 1. In other words, if a; = 0, then (i) is 
not a member of S, and if a; = 1, then x(¢) isa member of S. Next define g' : P(X) — {0,1}” 
as follows. Given T C X, let g'(T) = (b1,..., bn), where b; = 1 if z(i) € T, and b; = 0 if 
a(t) ¢ T. We ask the reader to check that g and g’ are well-defined functions such that gog’ 
and g'o g are identity maps. So g is a bijection. Composing g with the bijection f from the 
example, we get a bijection from {1,2,...,2”} onto P(X). Thus, |P(X)| = 2” as needed. 

To illustrate this proof, suppose n = 3, X = {a,b,c}, and z(1) =a, x(2) = b, z(3) = c. 
We find that g(1,0,1) = {x(1),2(3)} = {a,c}, g(0,1,0) = {a(2)} = {b}, and g(0,0,0) = 
{} = 0. Going the other way, g'({b,c}) = 9'({2(2),2(3)}) = (0,1, 1), gah) = d fe()}) = 
(1,0,0), and so on. 


Preservation of Finiteness 


Finally, we use the preceding results to prove part (i) of Theorem 7.3. Informally, this item 
says that doing finitely many set operations on finite sets always produces another finite 
set. Fix s € Z>o and finite sets X1,...,X, with |X;| = n; for each i. By the Product Rule, 
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|X1 X-X X| = Nina -+ Nns, 80 X1 X- -X Xs is finite. By the Power Set Rule, |P(X1)| = 2™, 
so P(X}) is finite. Item 2 of the theorem immediately implies that any subset of any finite 
set is finite. Since ge X; is a subset of the finite set X4, this intersection is a finite set. 

Proving finiteness of the union of the X; is a bit more complicated, since the Sum Rule 
for s Sets required that the given sets be pairwise disjoint. Here we use induction on s. If 
s=1, Ui Xi = Xı, and X, was assumed to be finite. Now fix s > 1, assume that the 
union of any s finite sets is finite, and prove that the union of any s + 1 finite sets is finite. 
Fix finite sets X1, ..., Xs, Xs+1. By the induction hypothesis, X = X, U +- U X, is finite, 
say |X| = n. By assumption, Xs+; is also finite, say |Xs+1| = k. The intersection X N Xs+1 
is finite, being a subset of the finite set Xs+1; say |X N Xs+1| = i. Now the General Sum 
Rule (part (e) in the theorem) tells us that |X U Xs+1| = n+ k — i, so Ua Xi = XUXs+ı 
is finite. 


Two Technical Proofs (Optional) 


We now prove part (a) of the Theorem on Finite Sets, which says that the size of a finite 
set is unique. We use induction on n to prove this statement: 


Yn, m € Z>0, YX, (|X| =n A |X| =m) >n=m. 


We prove the base case (n = 0) by contradiction. Assume, to get a contradiction, that there 
is a set X and there exists m € Zso with |X| = 0 and |X| = m and 0 # m. On one hand, 
|X| = 0 means X = @ by definition. On the other hand, |X| = m > 0 means there is a 
bijection f : {1,2,...,m}— X. Then f(1) € X contradicts X = 0. 

For the induction step, fix n € Zso, and assume Ym € Z>0,YX, (|X| = n ^A |X| = 
m) > n = m. We must prove Vp € Z>0,VY, ([Y| = n+1A|Y| = p > n+1 = p. Fix 
p E€ Z>o, fix a set Y, assume |Y| = n + 1 and |Y| = p, and prove n + 1 = p. Since 
n+ 1 > 0, we cannot have p = 0; otherwise, the argument in the previous paragraph leads 
to a contradiction. Thus, we have p = m + 1 for some m € Zo. By assumption on Y, there 
are bijections f : {1,2,...,n+1}— Y and g : {1,2,...,m+1} > Y. Let yo = f(n+1)E€ Y 
and z9 = g(m + 1) € Y. Suppose first that zo = yo. You can check that the restrictions 
F = filgiy,...ny and g' = glt1,2,...,:m} are both bijections onto the set X = Y — {yo}. Then 
|X| =n and |X| = m, so the induction hypothesis applies to show that n = m, and hence 
n+1 =m +1 = p as needed. 

If zo yo, we reduce to the case just considered as follows. Define h : Y > Y by setting 
h(zo) = yo, h(yo) = z0, and h(y) = y for all y € Y—{yo, zo}. You can verify that hoh = Idy, 
so h is a bijection with h~! = h. The composition ho g is a bijection from {1,2,...,m +1} 
to Y such that (ho g)\(m + 1) = h(g(m + 1)) = h(zo) = yo. Now we finish as before: the 
restriction of f to {1,2,...,n} and the restriction of hog to {1,2,...,m} are bijections 
from their respective domains onto X = Y — {yo}, so |X| = n and |X| = m, so n = m by 
induction hypothesis, so n + 1 = m + 1 = p. This completes the proof of part (a). 

We now use induction on n to prove part (b) of the Theorem on Finite Sets. We must 
show: Yn € Z>0, YX, VY, (|X| =n AY C X) > (Im € Zso, |Y| = mAm < n). For the base 
case, take n = 0, fix sets X and Y, and assume |X| = 0 and Y C X. By definition, |X| = 0 
means X = . We assumed Y C Ø, and we know C Y, so Y = Ø. Thus Y = 0 and |Y| = 0. 
So we may choose m = 0 in the statement to be proved. 

For the induction step, fix n € Z>9, and assume 


VX' VY", (X| =n AY’ CX’) > (Im € Z>0,| Y| =mAm<n). 
We must prove 


VX, VY, (|X| =n +1 AY C X) > (Ap € Zs0, |Y| =p^p<n+1). 
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Fix sets X and Y, assume |X| = n + 1 and Y C X, and prove dp € Z>0,|Y| =p <n+1. 
Choose a bijection f : {1,2,...,2 +1} —> X, define zo = f(n +1) € X, and note f’ = 
F\¢1,2,....n} is a bijection from {1,2,...,n} onto X’ = X — {xo}. Thus, |X’| = n. We know 
zo € Y or xo € Y, so consider two cases. Case 1. Assume zo ¢ Y. Since we also assumed 
Y C X, we see that Y C X’ in this case. We can apply the induction hypothesis taking 
Y’ = Y to see that for some nonnegative integer m < n, |Y| = m. Choosing p = m, we have 
|Y| =p <n+1 as needed. Case 2. Assume xo € Y. In this case, we take Y’ = Y — {ao} 
in the induction hypothesis, which is allowed since Y’ is a subset of X’. This leads to the 
existence of m < n with |Y’| = m, which means there is a bijection g : {1,2,..., m} > Y”. 
(If m = 0, g : @— @ is the empty function.) Define a function h: {1,2,..., m +1} > Y by 
setting h(i) = g(t) for 1 < i < m, and h(m + 1) = zo. We ask the reader to check that h is 
a bijection. Thus, |Y| = m+1 < n+ 1, so that the required existence statement holds with 
p=m-+1. This completes the proof of part (b). 


Section Summary 


1. Finite Sets. A set X has n elements iff there is a bijection f : {1,2,...,n} > X, 
in which case we write |X| = n. A set X is finite iff dn € Z>0, |X| =n. The size 
n is uniquely determined by X. A set X is infinite iff X is not finite. 


2. Sum Rule. Given a finite list of pairwise disjoint finite sets X1,..., Xs, 
|X, U XqU---UX.| = |X| + | Xo] +--+ [Xs]. 


Given any two finite sets X and Y, |X UY| =|X|+|Y|-|X NY]. 
3. Product Rule. Given a finite list of finite sets X1,..., Xs, 


|Xy x X2 x +++ x Xs] = [[ 1X: 
j=] 


4. Power Set Rule. Given an n-element set X, |P(X)| = 2”. 
5. Preservation of Finiteness. The union, intersection, and product of finitely many 


finite sets are finite sets. Each subset of a finite set is finite. The power set of a 
finite set is finite. 


Exercises 


1. (a) Prove: for all n € Zyo, |{1,2,... n} =n. 
(b) Prove: for all n € Z>o, {0,1,...,n — 1} has n elements. 
(c) Prove: for all n € Z>o, {—1,—2,...,—n} has size n. 

2. (a) Prove that X = {1,3,5,7,9} has 5 elements. (b) Prove that Y = {2,4,6,8} 
has 4 elements. (c) Use (a), (b), and the proof of the Sum Rule to build a bijection 
h: {1,2,...,9} > XUY. 

3. (a) Prove: for all sets X and Y, if X is finite, then X — Y is finite. (b) Prove: for 
all sets X and Y, if X — Y and Y are finite, then X is finite. 


4. Prove: for all sets X and Y, if Y C X and Y is infinite, then X is infinite. 


5. Verify these details in the proof of the Sum Rule: (a) h is a well-defined function 
mapping {1,2,...,n+k} into X UY. (b) h’ is a well-defined function mapping 
XUY into {1,2,...,n +k}. (c) hoh! =Idxuy. (d) h'o h = Idg,2,.. ntk} 


Cardinality 


10. 


11. 


12- 


13. 


14. 


15. 


16. 


17. 
18. 
19; 


Prove the set inclusion UŁ: Xi C X x Y in the proof of the Product Rule. 


Prove the general Product Rule (part (g) in the Theorem on Finite Sets) by 
induction on s. 


Finish the proof of the Power Set Rule by checking that g and g’ are well-defined 
functions such that go g’ and g' o g are identity maps. 


(a) Draw the arrow diagram for a bijection f : {1,2,...,8} — {0,1}%. (b) Use 
(a) and the proof of the Power Set Rule to draw an arrow diagram for a bijection 
from {1,2,...,8} onto P({a,b,c}). 


Prove carefully that the restriction of a bijection f : {1,2,...,n +1} > Y to 
{1,2,... n} is a bijection from {1,2,...,n} onto Y — {f(n + 1)}. 


Prove: for all finite sets X, Y, Z, 


IXUYUZ|=|X|+]|Y|4+|Z|-|XNY|-|XN Z| -|YNZ|4+|XnYN ZI. 


Given c,d € Zyo, define f : {0,1,...,c—1} x {0,1,...,d—1} > {0,1,...,cd— 1} 
by f(i,j) = di+ j. Aided by the Division Theorem, prove that f is injective and 
surjective. 


Use the previous problem to give a new proof of part (f) in the Theorem on Finite 
Sets. 


Prove carefully that Zs» is infinite. (Suggestion: One approach is to show by 
induction on n that for all n € Z>0, Z>o does not have size n.) 


Prove: for all sets X and Y and all n € Zso, if |Y| = n and there exists an 
injection g : X > Y, then |X| = m for some m < n. 


Prove: for all sets X and Y and all n € Zso, if |X| = n and there exists a 
surjection f : X > Y, then |Y | = m for some m < n. 


Prove: for all sets X and Y, if X is finite and Y C X and |X| = |Y |, then X = Y. 
Prove: for all finite sets X and all f : X —> X, f is injective iff f is surjective. 


(a) Prove that every nonempty finite totally ordered set has a least element. 
(b) Prove that every finite totally ordered set is well-ordered. 
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7.2 Countably Infinite Sets 


Now that we understand finite sets, the next step is to examine countably infinite sets, 
which are the smallest infinite sets. Informally, a set X is countably infinite iff it is possible 
to list all the elements of X (with no repetitions) in a sequence a1, @2,...,@n,..-, where 
the terms of the sequence are indexed by positive integers. Formally, this means that there 
exists a bijection from Zs onto X. We will see that the following sets are all countably 
infinite: Z>0, Z>0, Z, Q, Z*, and Q! (where k is any positive integer). More generally, the 
product of finitely many countably infinite sets is still countably infinite. 


Cardinality Definitions 
The following definition allows us to compare the size (cardinality) of two arbitrary sets. 


7.5. Definition: Cardinality. For two sets A and B: 


(a) | |A| = |B| | means | there exists a bijection f : A > B |. 


(b) | |A| < |B] | means | there exists an injective function g: A > B|. 


When A is finite with n elements, the symbol |A| denotes the nonnegative integer n. For 
an infinite set A, we do not attempt to assign a meaning to the symbol |A| by itself; only 
the longer expressions “|A| = |B|” and “|A| < |B|” have been defined here. We now show 
that these concepts obey some expected rules of equality and order. 

7.6. Theorem on Cardinal Equality and Order. For all sets A, B, C, A’, and B’: 
(a) Reflexivity: |A| = |A]. 

b) Symmetry: If |A| = |B|, then |B| = |A]. 

c) Transitivity: If |A| = |B| and |B| = |C], then |A| = |C]. 

d) Reflexivity of Order: |A| < |A]. 

e) Antisymmetry of Order: If |A| < |B| and |B| < |A], then |A| = |B}. 

f) Transitivity of Order: If |A| < |B| and |B| < |C], then |A] < |C]. 
) 


g) Disjoint Union Property: If |A| = |A'| and |B| = |B'| and AN B = @ = A' N B’, then 
|AU B| = |A’U B'|. 


(h) Product Property: If |A| =|A’| and |B| = |B'|, then |A x B| =|A’ x B'I. 
(i) Power Set Property: If |A| = |A'|, then |P(A)| = |P(A’)]. 


( 
( 
( 
( 
( 
( 


Proof. Fix sets A, B, C, A’, and B’. To prove |A| = |A|, we must find a bijection from A 
to A. The identity map Id4 : A > A is such a bijection. For part (b), assume |A| = |B| 
and prove |B| = |A|. We assumed there is a bijection f : A > B. Then f-': Ba A 
is a bijection, so |B| = |A|. For part (c), assume |A| = |B| and |B| = |C|, so there exist 
bijections f : A > B and g : B > C. The composition g o f is a bijection from A to C, 
so |A| = |C|. Parts (d) and (f) are proved similarly, replacing bijections with injections. 
But part (e) is much more difficult to prove. This is a famous result called the Schröder- 
Bernstein Theorem, which we prove in §7.3. 

For the remaining properties, assume |A| = |.A’| and |B| = |B’|, which means there are 
bijections f : A > A’ and g : B > B’. To prove part (h), define h : A x B > A’ x B’ by 
h(a,b) = (f(a), g(b)) for (a,b) € A x B, and define h’: A’ x B' > Ax B by h'(a’,b’) = 
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(f-t(a’'),g~1(b')) for (a',b') € A’ x BY’. It is routine to check that h'o h = Idaxg and 
ho R =Idxxp, so h and K are bijections. Thus, |A x B| = |A’ x B’| as needed. To prove 
part (g), assume AN B= = A N B’. Define p: AU B —> A’ U B’ by letting p(x) = f(z) 
if x € A, and p(x) = g(x) if x € B. Then p is a bijection because it has a two-sided inverse 
p' : A'UB' —> AUB defined by letting p'(y) = f-'(y) if y € A’, and p'(y) = g7! (y) if 
y € B’. You are asked to prove part (i) as an exercise. 


Countably Infinite Sets 


In Exercise 14 of §7.1, you were asked to prove that Zyo is an infinite set. The next definition 
introduces countably infinite sets, which are sets with the same cardinality as Z>o. 


7.7. Definition: Countably Infinite Sets. For all sets X: 


X is countably infinite | iff | there exists a bijection f : Z>o > X |. 


Note that X is countably infinite iff |Z>o| = |X|. For all sets X and Y, if |X| = |Y| 
then X is countably infinite iff Y is countably infinite. This follows from symmetry and 
transitivity of cardinal equality. The next few propositions provide specific examples of 
countably infinite sets. 


7.8. Proposition. For all a € Z, Z>a and Z>a and Z<a and Z<a are countably infinite 
sets. 


Proof. We prove the statement about Z>a as an example. Fix a € Z. Writing Z>o = Z>1, 
we need to construct a bijection f : Z>1 > Z>a. Define f(n) =n+a—1 for n € Zs}. Also 
define g : Z>a > Zs, by g(m) = m — (a — 1) for m € Z>a. You can check that f and g map 
into their claimed codomains, and g is a two-sided inverse of f. Hence, f is the required 
bijection. 


7.9. Proposition. Z is countably infinite. 


Proof. We build a bijection f : Z>o > Z as follows. Each n € Zyo is either even or odd, 
but not both. If n is even, define f(n) = n/2, which is in the codomain Z. If n is odd, define 
f(n) = —(n — 1)/2, which is in the codomain Z. To see that f is a bijection, we describe 
its inverse g : Z > Zyo. For m E Z, let g(m) = 2m if m > 0, and let g(m) = —2m + 1 if 
m < 0. The arrow diagram in Figure 7.4 illustrates the action of g. You can prove that g 
maps into Zs and is the two-sided inverse for f. We check one case as an example. Given 
m E€ Z with m < 0, we must show f(g(m)) = m. First, g(m) = —2m + 1, which is odd. So 
f(g(m)) = —(—2m + 1 — 1)/2 = 2m/2 = m, as needed. 
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FIGURE 7.4 
A bijection g from Z>o to Z&o. 
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Countability of Z x Z 


For any set A, recall that A? denotes the set A x A, which consists of all ordered pairs (x, y) 
with x € A and y € A. We now show that the product set Z? and certain related sets are 
countably infinite. Our proof gives surprising applications of material from Chapter 4. 


7.10. Proposition. ZS, and Zo and Z? are countably infinite sets. 


Proof. We first define a bijection h : Z2 > Zso. Let h(a, b) = 2°(2b+ 1) for all a,b € Z>o. 
We prove that h is one-to-one and onto. To see that h is injective, fix a,b,c,d € Z>o with 
h(a,b) = h(c,d), and prove (a,b) = (c,d). We have assumed 2°(2b + 1) = 2°(2d +1). We 
know a >c ora < cor a = c, so consider three cases. 

Case 1. Assume a > c. Rewrite the assumption as 2°7°(2b + 1) = 2d + 1. The left side of 
this equation is an even integer since a — c > 0, but the right side is an odd integer. This 
contradicts the earlier theorem that no integer is both even and odd. 

Case 2. Assume a < c. Here we can write 2b + 1 = 2°-*(2d+1) with c— a > 0. Again there 
is a contradiction because the left side is odd and the right side is even. 

Case 3. Assume a = c. Dividing the assumption by 2° = 2°, we get 2b + 1 = 2d + 1, and 
hence b = d. Thus, (a,b) = (c,d) as needed. 

To see that h is surjective, fix n € Z>o; we must find (a,b) € Z2, with h(a, b) = n. 
Using the formula for h, we need to solve 2°(2b+ 1) = n for a and b. To do so, use the Fun- 
damental Theorem of Arithmetic to write n = 2% p3 <- py, where k > 1, €1,...,e, > 0, 
and p2,...,pẹķ are distinct odd primes. Comparing to the previous expression, we choose 
a =e, and b = (p3? --- p;* — 1)/2 (which is in Z>o) to achieve h(a, b) = n. Note that b = 0 
when k = 1. 

So far, we have proved that |Z%o| = |Zso|. We also know |Z| = |Zso| = |Z>o|, so it follows 
from part (h) of Theorem 7.6 that |Z x Z| = |Zso x Z>oļ = |Zso x Zso| = |Zso|. By transi- 
tivity and symmetry, we conclude that Z?, Z2}, and Z2, are all countably infinite sets. 


We may visualize the map h in the previous proof by labeling each point (a,b) in a 
picture of Z%, with the corresponding positive integer h(a,b), as shown on the left in 
Figure 7.5. Pictures like this one provide powerful intuition for defining bijections on infinite 
sets. For example, the picture on the right in Figure 7.5 suggests another possible bijection 
k: Zo — Zo, obtained by traversing integer points in the first quadrant one diagonal at 
a time. 
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4 A 
9) 18. 36. 72,144 11) 17, 24, 32. 41 
eo o` o “ee e o o` o o 


71 14, 28. 56 112 
eè o o` o o 
54 10, 20. 40. 80 
eo 0o o o o 


3) 6, 12, 24. 48 
eo o o o o 


FIGURE 7.5 
Pictures of two bijections from Zo to Zo. 


The new figure gives compelling visual insight into why there is a bijection from Z2, 
to Zso, but how do we translate the figure into a rigorous proof? One approach is to 
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give a recursive definition of the inverse f : Z>o —> Z%, for k. For the base case, define 
f(1) = (0,0). For the recursive case, fix n € Z„o and suppose f(n) = (a,b) has already 
been defined. Motivated by the figure, we set f(n + 1) = (a + 1,b — 1) if b > 0, and we 
set f(n +1) = (0,a+ 1) if b = 0. Intuitively, the first case moves one step down and right 
along the current diagonal. The second case occurs when n appears at the lower-right end 
of one diagonal, so that n + 1 must be placed at the upper-left end of the next diagonal. 
It can now be proved by induction that f is one-to-one and onto. Remarkably, the map 
k = f~t is given by a simple non-recursive formula: k(a,b) = ((a + b)? + 3a + b)/2 + 1 for 
(a,b) € Zo. You can check that k(f(n)) = n for all n € Zyo by induction on n, and that 


f(k(a,b)) = (a,b) for all a,b € Z&q by induction on a + b, with an inner induction on a. 


Countability of Q and Finite Products 


Now that we know Z x Z is countably infinite, we can show that the set of rational numbers 
is countably infinite. 


7.11. Proposition. Q is countably infinite. 


Proof. By the Schréder-Bernstein Theorem (part (e) of Theorem 7.6), it suffices to show 
|Z>o| < |Q| and |Q| < |Z>olļ. [In the next section, we give another proof of the countability 
of Q not relying on this hard result.] On one hand, the inclusion map i : Z>o — Q given by 
i(n) = n/1 for all n € Zyo is one-to-one, so |Z>o| < |Q| holds by definition. On the other 
hand, define f : Q — Z x Z as follows. Given q € Q, there exist unique integers m and 
n with n > 0, ged(m,n) = 1, and q = m/n; we define f(q) = (m,n). Now f is injective, 
because if q,r € Q satisfy f(q) = (m,n) = f(r), then q = m/n = r. Since Z x Z is known to 
be countably infinite, there is a bijection g : Zx Z > Zyo. Then go f : Q > Z>o is injective, 
being a composition of two injective functions. Therefore |Q| < |Zso|, as needed. 


Next we prove that a product of finitely many countably infinite sets is also countably 
infinite. 


7.12. Theorem on Countability of Products. For all k € Zyo and all sets X,,..., Xx, 
if every X; is countably infinite, then Xı x X2 x --- x X_ is countably infinite. 


Proof. We use induction on k. The base case k = 1 is immediate. Fix k € Zyo, and 
assume the product of any k countably infinite sets is countably infinite. Next, fix k + 1 
countably infinite sets X1,..., Xk, Xk+1, and prove X = X, X++- X Xk X Xk+1 is countably 
infinite. Let Y = Xı x --- x Xk, which is countably infinite by the induction hypothesis. 
Define f : X —> Y x Xp41 by letting f send (z1,..., Zk, 0441) to ((@1,..., 2k), k41) for 
(%1,.--,@n41) E€ X. You can check that f is a bijection, so |X| = |Y x Xx41|. We know 
Y| = |Zso| and |Xx41| = |Zso| since Y and X;,41 are countably infinite. By part (h) of 
Theorem 7.6, we conclude that |Y x X,41| = |Zso x Zso|. Now Z2, is countably infinite, so 
|Z>0 X Zso| = |Zso|. Combining all these cardinal equalities using transitivity, we conclude 
that |X| = |Zso|, as needed. 


7.13. Corollary. For any countably infinite set X and any k € Zo, X* is countably 
infinite. In particular, Zs and Z* and Q* are countably infinite. 


Section Summary 


1. Cardinal Equality. For all sets A and B, |A| = |B| means there exists a bijection 
f : A —> B. Cardinal equality is reflexive, symmetric, and transitive. If | A] = |A’| 
and |B| = |B’|, then |A x B| = |A’ x B’| and |P(A)| = |P(A’)|; if also AN B = 
0 = A’ B’, then |AU B| = |A’U B'|. 
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2. Cardinal Ordering. For all sets A and B, |A| < |B| means there exists an in- 
jection g : A — B. Cardinal ordering is reflexive, antisymmetric, and transitive. 
Antisymmetry (for all A and B, if |A| < |B| and |B| < |A], then |A| = |B|) isa 
hard result called the Schréder—Bernstein Theorem. 

3. Countably Infinite Sets. A set X is countably infinite iff |Z>o| = |X|. The following 
sets are countably infinite: Z, Z>a for any a € Z, Z x Z, Q, products of finitely 
many countably infinite sets, Z*, and Q* (where k € Z>o). 


Exercises 


1. Prove: for all sets X and Y, if |X| = |Y |, then |X| < |Y]. 

2. (a) Prove: for all sets X, Y, and Z, if |X| = |Y| and |Y| < |Z], then |X| < |Z]. 
(b) Prove: for all sets X, Y, and Z, if |X| < |Y| and |Y| = |Z], then |X| < |Z]. 

3. (a) Prove: for all sets X and Y, if X C Y, then |X| < |Y]. (b) Prove or disprove 
the converse of part (a). 


4. (a) Complete the proof of part (h) of Theorem 7.6 by showing h'o h and ho h’ 
are identity maps. (b) Finish the proof of part (h) without using h’, by proving 
that h is one-to-one and onto. 


5. Complete the proof of part (g) of Theorem 7.6 by showing p and p’ are well-defined 
functions such that p'o p and pop’ are identity maps. Where is the assumption 
ANB =Q = A'N B’ needed? 


6. Prove part (i) of Theorem 7.6. 


Prove from the definition that each set below is countably infinite: (a) the set of 
odd positive integers; (b) the set of positive multiples of a fixed k € Z; (c) the set 
of all even integers. 


8. Explain why every countably infinite set actually is infinite (not finite). 
9. Complete the proof of Proposition 7.8. 
10. Complete the proof of Proposition 7.9. 


11. (a) Show that f : ZR — Zso defined by f (a,b,c) = 2%3°5° for a,b,c € Z>o is 
injective. (b) Trace through proofs from this section to find an explicit formula 
for a bijection g : Zło > Zso. 


12. (a) Prove that the map f defined recursively below Figure 7.5 is one-to-one and 
onto. (b) Prove that (a,b) = ((a +b)? + 3a +b)/2+ 1 is the two-sided inverse of 


13. The diagram below depicts yet another bijection p : Z2 — Z. Formally define 
the function p (perhaps recursively), and prove p is a bijection. 


A 
171 18. 19. 20. 21 
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14. Draw a diagram (similar to the one in the previous problem) depicting a bijection 
q:ZxZ-— Z. Define q formally, and prove q is a bijection. 


15. Prove: for all nonempty sets A and B, |A| < |B| iff there exists a surjection 
h: BoA. 

16. Suppose J is any index set, and for each i € I, A; and A’ are sets such that 

A;| = [Ai]: Prove: if for alla A j in I, A4; N Aj = Ø = A; N A}, then |U;ez Ail = 

Vier 4il: 

17. Suppose k € Z>o, and for 1 < i < k, A; and A; are sets such that |A;| = |A;]. 

a) Prove |A; x +- x Ak| = |A} x--- x A} | by constructing a bijection. (b) Prove 

the result in (a) by induction on k, using part (h) of Theorem 7.6. 

18. (a) Prove that the union of two countably infinite sets is countably infinite. 

b) Prove: for all k € Z>o and all sets X1,..., Xx, if every X; is countably 

infinite, then Ui X; is countably infinite. 


19. Suppose that for every n € Z>0, Xn is a given countably infinite set. Prove that 
X = Unez., Xn is countably infinite. [Hint: Use the fact that Z2 o is countably 
infinite.] 


20. Let X be any countably infinite set. (a) Prove there exists f : X —> X that is 
injective but not surjective. (b) Prove there exists g : X — X that is surjective 
but not injective. 
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7.3 Countable Sets 


This section studies countable sets, which are sets that are either finite or countably infinite. 
Our main results are: any subset of a countable set is countable; products of finitely many 
countable sets are countable; and unions of countably many countable sets are countable. 


Countable Sets and Their Subsets 
7.14. Definition: Countable Sets. For all sets X, 


X is countable | iff | X is finite or X is countably infinite. 


We intend to show that any subset of any countable set is countable. First we consider 
the special case of subsets of Zso. 


7.15. Lemma on Subsets of Zo. Every subset of Zs is countable. 


Proof. Fix an arbitrary subset T of Z>o. To prove T is countable, assume T is not finite and 
prove T is countably infinite. We construct a bijective function f : Z>o —> T recursively. 
Intuitively, f(n) = t will mean that t is the nth smallest element of T. To make this precise, 
recall from $6.5 the Well-Ordering Property of Zso, which tells us that every nonempty 
subset of Z>o has a least element. Since T is certainly nonempty, we may define f(1) to be 
the least element of T. Now fix n € Zso, and assume that f(1), f(2),..., f(n) have already 
been defined. We also assume f(1),..., f(n) are pairwise distinct members of T. Note that 
Tn =T —{f(1), f(2),..., f(n)} is a subset of Zo that is nonempty. For if T, were empty, 
then T C {f(1), f(2),..., f(m)}, so that T would be finite, contrary to our assumption. We 
define f(n + 1) to be the least element of Tn, which is a member of T different from f(7) 
for all i between 1 and n. This completes the recursive definition of f. Figure 7.6 illustrates 
the definition of f for the set T = {3,6,10,11,13,...}. 


iii 
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FIGURE 7.6 
Example of the bijection f : Z>o > T. 


To show that f is injective, fix m,p € Z>o, assume m Æ p, and prove f(m) Æ f(p). 
We know m < p or p < m. In the case where m < p, f(p) # f(m) since f(m) is in the 
set {f(1),...,f(p —1)}, but f(p) is not in this set by definition. In the case where p < m, 
f(p) 4 f(m) since f(p) is in the set {f(1),..., f(m — 1)}, but f(m) is not in this set. 

To show that f is surjective, we first prove that f(n) > n for all n € Zso. We use 
induction on n. For the base case, f(1) is a member of T, so f(1) € Z>o, so f(1) > 1. Now 
fix n € Zo, assume f(n) > n, and prove f(n+1) >n+1. Assume, to get a contradiction, 
that f(n+1) <n+1. Since f(n+1) is an integer, it follows that f(n+1) < n < f(n). Now, 
f(n +1) is the least element of T — {f(1), f(2),...,f(n)}, so f(n +1) 4 f(n). We deduce 
that f(n +1) < f(n). But now f(n + 1) is an element of the set T — {f(1),...,f(n — 1)} 
strictly smaller than f(n), which contradicts the definition of f(n) as the least element of 
this set. So f(n +1) > n+ 1, completing the induction. 
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Now we prove f is surjective. Fix t € T; we must show there exists n € Zso with 
f(n) = t. We have shown that f(t) > t. If f(t) = t, we may choose n = t. If f(t) >t, then 
the least element of the set S = T — {f(1), f(2),..., f(t — 1)} (which is f(t) by definition) 
is larger than t. It follows that t cannot belong to S. But t does belong to T, which forces 
t = f(n) for some n € {1,2,...,t — 1}. Thus, f is surjective. 


7.16. Theorem on Countable Sets. For all sets X and Y: 
(a) If |X| = |Y |, then X is countable iff Y is countable. 

(b) |X| < |Y| iff IW, W CY and |X| = |W]. 

(c) X is countable iff |X| < |Zsol. 

(d) If |X| < |Y| and Y is countable, then X is countable. 

(e) If X CY and Y is countable, then X is countable. 


Proof. We prove parts (b) through (e), leaving part (a) as an exercise. For part (b), first 
assume |X| < |Y|, which means there is an injection f : X — Y. Choose W = f[X] to be 
the image of f, which is a subset of Y. Let f’: X — W be obtained from f by restricting the 
codomain from Y to W. Then f’ is injective since f is, and f’ is surjective by construction. 
So |X| = |W|. Conversely, assume |X| = |W| for some W C Y. This means there is a 
bijection g : X — W. Composing g with the injective inclusion map i: W — Y, we obtain 
an injection io g : X — Y, which proves that |X| < |Y]. 

For part (c), first assume X is countable. In the case where X is finite, we know there 
exists n E€ Z>o with |X| = |{1,2,...,n}]. Since {1,2,...,n} is a subset of Z>o, part (b) 
applies to show that |X| < |Zso|. Similarly, in the case where X is countably infinite, we 
know |X| = |Zso|. Since Zso is a subset of itself, part (b) applies to show that |X| < |Zsol. 
Conversely, assume |X| < |Zso|. By part (b), there exists W C Zso with |X| = |W]. By 
Lemma 7.15, W is countable. By part (a), X is also countable. 

For part (d), assume |X| < |Y| and Y is countable. Then |Y| < |Zso| by part (c), so 
|X| < |Zso| by transitivity, so X is countable by part (c). For part (e), assume X C Y and 
Y is countable. The inclusion i : X — Y is injective, so |X| < |Y|. Thus, X is countable by 
part (d). 


Using this theorem, we can see that Q is countably infinite without invoking the 
Schréder—Bernstein Theorem. Recall that we showed |Q| < |Z x Z| in the proof of Proposi- 
tion 7.11. Since |Z x Z| is known to be countably infinite and hence countable, we conclude 
that Q is countable by part (d) of Theorem 7.16. Now Q is not finite (since it contains the 
infinite subset Z.9), so Q is countably infinite. 


Products of Countable Sets 


We have seen that the product of finitely many finite sets is finite, and the product of finitely 
many countably infinite sets is countably infinite. We now establish the corresponding result 
for countable sets. 


7.17. Theorem on Products of Countable Sets. For all positive integers k and all sets 
X1,...,XK,V1,..-, Ve: 

(a) If |X;| < |Y;| for 1 < i < k, then |X, x Xə x -+ x Xg] < |Y1 x Yo x <- X Ygl. 

(b) If every X; is countable, then X1 x X2 x --- x Xp is countable. 


Proof. Fix k € Zyo and sets X;, Y; for 1 < i < k. Let X = X1x- - -x Xp and Y = Y1 x- -X Yp. 
For part (a), assume |X;| < |Y;| for all 7, which means there exist injections f; : X; — Y; for 
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all i. We must build an injection f : X — Y. Define f(x1,..., £k) = (fi(@1),.--, fk(£k)) for 
(a1,...,%,%) E X. To see that f is injective, fix z = (£1,..., 8p) E X, a’ = (v,...,24,) E 
X, assume f(x) = f(a’), and prove x = x’. We have assumed (f1(%1),.-., fkl£k)) = 
(Filzi) fr(x,)), so filz) = fi(at) for 1 <i < k. Since each f; is injective, we conclude 
that x; = x; for 1 < i < k, which means x = 2’. 

For part (b), assume every X; is countable. By part (c) of Theorem 7.16, |X;| < |Zso 
for 1 < i < k. By what we proved in the last paragraph, |X| = |X1x---* X| < |Zso X- x 
Zso| = |ZEo| = |Zso|, so |X| < |Zso|. By part (c) of Theorem 7.16, X is countable. 


Unions of Countable Sets 


We have seen that the union of finitely many finite sets is finite. Now we can show that the 
union of countably many countable sets is countable. 


7.18. Theorem on Unions of Countable Sets. 

(a) Suppose Xx is a countable set for each k € Zo. Then Up, Xx is countable. 

(b) For all n € Zyo and all sets Y1, . . . , Yn, if every Y; is countable, then U;_, Yx is countable. 
Proof. We first prove part (a) under the additional assumption that the sets X;, are pairwise 
disjoint and countable. We know |Xx| < |Zso| for each k, so there exist injections fy : X; > 
Z>o for each k € Zyo. Let X =U, Xx, and define f : X > Z2, as follows. Given x € X, 
there exists a unique k € Zso with x € Xx, because the sets X_ are pairwise disjoint. Define 


f(x) = (k, fr(x)) € Zo. Figure 7.7 illustrates how f sends each set X;, into the kth vertical 
slice of Z2,. In this figure, |X| = 4, |X3| = 3, and Xj and X4 are countably infinite. 


fIX,] 1X4] 


FIGURE 7.7 
Illustration of the map f : UP, Xp > Zo. 


We now show that f is injective. Fix x,y € X, assume f(x) = f(y), and prove x = y. 
We have f(x) = (k, fx(x)) and f(y) = (j, fi(y)) for some j,k € Zo. Since f(x) = f(y), 
we have k = j and fx(x) = f;(y) = fr(y). Because fp is injective, x = y follows. Because 
f is one-to-one and Z2 is countably infinite, we conclude that |X| < |Z2o| = |Zso|. So 
|X| < |Zso|, hence X is countable by part (c) of Theorem 7.16. 

Now we prove the general version of part (a). Assume each X; (for k € Zyo) is a 
countable set. Define Xi = X, and, for each k > 1, X; = Xk — ee Xi. For each k > 0, 
Xj, C Xk, so Xj, is countable. You can check that Ue, Xs = UZ2, X;, and that the sets 
Xj, are pairwise disjoint. Since the union of the sets Xj, is countable by the special case of 
(a) already proved, we conclude that the union of the sets X;, is countable, as needed. 
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Part (b) follows from part (a) by taking X, = Yp for 1 < k < n, and X; = 9 for all 
k>n. 


Section Summary 


1. Countable Sets. A set X is countable iff X is finite or countably infinite. X is 
countable iff |X| < |Zso| iff there is a bijection from X onto a subset of Zo. If 
|X| < |Y| and Y is countable, then X is countable. 


2. Operations on Countable Sets. Any subset of a countable set is countable. A 
product of finitely many countable sets is countable. A union of countably many 
countable sets is countable. 


Exercises 


1. Prove part (a) of Theorem 7.16. 


2. Complete the proof of Theorem 7.18 by showing that the sets Xj, are pairwise 
disjoint, and UZ] Xk =U, X. 

3. (a) Prove: for all sets X and Y, if X is countable, then X — Y is countable. 
(b) Prove: for all sets X and Y, if X is countably infinite and X MY is finite, 
then X — Y is countably infinite. 


4. Give a specific example of countably infinite sets X and Y such that Y C X and 
X — Y is countably infinite. 


5. (a) Give a specific example of pairwise disjoint countably infinite sets Xp (for 
each k € Z>o) such that Z = UP, Xx. (b) Give a specific example of pairwise 
disjoint countably infinite sets Y, (for each k € Z>o) such that Q = UR, Yx- 


6. A finite sequence with values in a set X is a function f : {1,2,...,k} —> X for 
some k € Zs. Show: for all countable sets X, the set of all finite sequences with 
values in X is countably infinite. 


7. Show: for all countable sets X, the set of all finite subsets of X is countable. 


8. Let T be an infinite subset of Zo. In the proof of Lemma 7.15, we constructed 
a bijection f : Z>o + T. Prove that the function g : T > Zso, given by g(t) = 
[TA {1,2,...,¢}| for t € T, is the two-sided inverse of f. 


9. (a) Suppose we try to prove that every subset of Z is countable by replacing Zs 
by Z everywhere in the proof of Lemma 7.15. Explain precisely why the proof 
does not work. 

(b) Repeat part (a), but now replace Z>o by Rso. 


10. Let W be a countably infinite well-ordered set (see §6.5), and let T be an infinite 
subset of W. (a) Show that the recursive construction of f : Z>o —> T in the proof 
of Lemma 7.15 extends to this situation and produces a well-defined injective 
function f. (b) The proof that f is surjective does not extend to this setting. 
Explain the exact point in the proof that fails. (c) Consider the specific example 
W = {0,1} x Zyo, ordered lexicographically (see Exercise 12 in §6.5). Prove that 
the function f in part (a) is surjective iff T N ({1} x Zso) = 0. 

11. The proof that a union of countably many countable sets is countable requires 
the Axiom of Choice. Find the exact point in the proof where this axiom is used. 
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7.4 Uncountable Sets 


One of the unintuitive aspects of cardinality theory is that there are many different “sizes” 
of infinite sets. We have already studied countably infinite sets, which are the “smallest” 
infinite sets. In this section, we produce some concrete examples of uncountably infinite 
sets, including: the set R of real numbers; any open interval (a,b) or closed interval [a,b] 
with a < b; the set P(Z>o) of all subsets of Z>o; and the set of infinite sequences of zeroes 
and ones. We also prove Cantor’s Theorem, which says that P(X) always has strictly larger 
cardinality than X, and the Schroder—Bernstein Theorem, which asserts the antisymmetry 
of the relation |X| < |Y]. 


Definition of Uncountable Sets 


7.19. Definition: Uncountable Sets. For all sets X, X is uncountable iff X is not 
countable. 


The following theorem is readily proved, using Theorem 7.16. 


7.20. Theorem on Uncountable Sets. For all sets X and Y: 
(a) If |X| = |Y |, then X is uncountable iff Y is uncountable. 

(b) If |X| < |Y| and X is uncountable, then Y is uncountable. 
( 


c) If Y has an uncountable subset, then Y is uncountable. 


Uncountable Sets of Sequences 


To prove that an infinite set X is countable, it suffices to construct one particular bijection 
from Zs to X. On the other hand, proving the uncountability of X is harder, since we 
must show that every function from Zs to X fails to be bijective. To provide a specific 
example of an uncountable set, we need the notion of an infinite sequence. 


7.21. Definition: Infinite Sequences. For any set X, an infinite sequence with values in 
X is a function f : Z>o > X. When thinking of f as a sequence, we often write fn instead 
of f(n) and call fn the nth term of the sequence. 


7.22. Theorem on Uncountable Sets of Sequences. For any set X with more than 
one element, the set S of all infinite sequences with values in X is uncountable. 


Proof. Fix a set X with more than one element, and fix two elements a 4 b in X. To prove 
S is not countable, we must show S is not finite and S is not countably infinite. You can 
prove that S is not finite by exhibiting an injection from the infinite set Zo into S (see 
Exercise 5). To show that S is not countably infinite, we must prove there does not exist a 
bijection from Z>o onto S. Our strategy is to prove that every function g : Z>o —> S is not 
surjective (and hence not bijective). 

Fix a function g : Zs) > S. Surjectivity of g means Vf € S,dm € Zso,g(m) = f. 
Negating this, we must prove If € S,Vm € Zso,g(m) Æ f. Note that for each m > 0, g(m) 
is an infinite sequence, i.e., a function from Zs to X. The nth term of the sequence g(m) 
is g(m)(n), which we abbreviate as g(m)n Or gm,n. We can visualize g itself as an infinite 
table, where row m of the table contains the terms of the sequence g(m), as shown here: 
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n=1 n=2 n=3 n=4 n=5 
g(1) 91,1 91,2 91,3 91,4 91,5 
g(2): 92,1 92,2 92,3 92,4 92,5 
g(3): 93,1 93,2 93,3 93,4 93,5 
g(A): 
(5) 


94,1 94,2 94,3 94,4 94,5 
95,1 95,2 95,3 95,4 95,5 


Our goal is to build a sequence f that is unequal to every sequence in this table. For 
each n € Zyo, define fn = b if gx, = a, and define fn = a if gnn A a. Fix an arbitrary 
m € Z>o. We must prove g(m) Æ f. To prove that the sequence g(m) is unequal to the 
sequence f, it is enough to find one position where these sequences disagree (by definition 
of equality of functions). By construction of f, we have fm Æ 9mm = 9(™)m, so that the 
sequence f and the sequence g(m) disagree at position m. Thus g(m) 4 f, as needed. 


Here is an example illustrating the preceding proof. Suppose X = {1,2,3}, and the table 
for g looks like this: 


n=1 n=2 n=3 n=4 n=5 
g(1): | [1] 1 1 1 1 
g(2):| 2 2 2 2 2 
g(3): 3 3 3 3 3 
g(4):| 1 2 1 2 1 
g(5): 3 1 2 2 1 


To build a sequence f not appearing in this table, we read down the main diagonal of 
the table and change every entry. Taking a = 1 and b = 2, the sequence f for this g has 
first five terms fı = 2, f2 = 1, fs = 1, f4 = 1, and fs = 2. We see that f 4 g(1) since these 
sequences disagree in position 1; f # g(2) since these sequences disagree in position 2; and 
so on. This construction is called Cantor’s diagonal argument. 


Uncountable Sets of Real Numbers 


Now that we have some uncountable sets available, we can use bijections and injections to 
show that certain subsets of R are uncountable. 


7.23. Theorem on Uncountable Subsets of R. The following sets are uncountable: the 
set R of real numbers; the open interval (0,1); the open interval (a,b) for all a < b in R; 
and the closed interval [a,b] for alla < bin R. 


Proof. First we show (0,1) is uncountable. We need the following fact about decimal repre- 
sentations of real numbers, which we state without proof: for every real number r € (0, 1), 
there exists exactly one infinite sequence f with values in {0, 1, 2, 3, 4, 5,6,7,8,9} such that 
r=, f(k)10-*, and ~4k, Ym > k, f(m) = 9. Informally, f(k) is the kth digit after the 
decimal point in the decimal expansion of r. To ensure uniqueness, we forbid expansions 
that end in an infinite string of 9s. For example, taking r = 1/3, the sequence f has f(k) = 3 
for all k € Z>o. Taking r = 1/2 = 0.5000 - - - = 0.4999 - - - , the sequence f has f(1) = 5 and 
f(k) =0 for all k > 1. 

Now let S be the set of all infinite sequences with values in X = {4,7}. We already know 
S is uncountable. Define G : S —> (0,1) by setting G(f) = Yg] f(k)10-* for f € S. Using 
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calculus, you can check that this infinite series does converge to some number in (0,1). The 
fact cited above ensures that G is injective, so |S| < |(0,1)|. By part (b) of Theorem 7.20, 
(0,1) is uncountable. 

Next, fix a,b € R with a < b. The map h: (0,1) — (a,b), defined by h(t) = a+ (b — a)t 
for 0 < t < 1, is readily seen to be a bijection. Thus |(a,b)| = |(0,1)|, so the open interval 
(a,b) is uncountable. It follows that [a,b] and R itself are uncountable, since the uncountable 
set (a,b) is a subset of each of these sets. 


We now present another proof that closed intervals in R are uncountable. This proof does 
not rely on the existence and uniqueness of decimal expansions of real numbers. Instead, we 
need a much more elementary fact called the Nested Interval Property. This property says 
that for any sequence of nonempty closed intervals |an, bn] such that [an+1, bn+1] C [an, bn] 
for all n € Zso, the intersection X] fan, bn] is nonempty. See §8.5 for a proof of this 
property. 


7.24. Theorem on Uncountability of Closed Intervals. For all real a < b, [a,b] is 
uncountable. 


Proof. Fix real a < b. We fix an arbitrary function (or sequence) f : Zso — [a,b] 
and prove that f cannot be surjective. We recursively construct a sequence of nested 
intervals [an,bn] for n € Z>o with the property that [an,b,] C [a,b], an < bn, and 
lan, On] N {f(1), f(2),..., f(n)} = Ø for all n > 0. To begin, let ag = a and bọ = b. 
Now, fix n € Z>o and assume ao, @1,...,@n, bo, 01,...,6n have already been chosen in 
a way that satisfies the property above. Let d = bn — an > 0 be the length of the inter- 
val [an, bn], and consider the three subintervals [an,an + d/3], [an + d/3, an + 2d/3], and 
[an + 2d/3, bn]. These three intervals are subsets of [an, bn] and hence do not contain any 
of the real numbers f(1),..., f(n). Furthermore, at most two of the three subintervals can 
contain f(n + 1). [It is possible for f(n +1) to belong to two of the intervals; this happens 
when f(n+1) = a,+d/3 or f(n+1) = an +2d/3.] Choose [an41, bn+1] to be the first of the 
three intervals that does not contain f(n +1). Then [an+1,bn+1] © lan, bn], aAn+1 < bn41, 
and [an41,0n4i] N{f(1),.--,f(n), f(n + 1)} = 0. By recursion, this defines [an, bn] for all 
n € Z>o. To finish, invoke the Nested Interval Property to see that there must exist a 
real number r in the intersection of all the intervals [an, bn]. For fixed n € Zso, we know 
r € jan, bn] and hence r 4 f(n) by construction. Since n was arbitrary, we see that r is a 
number in fa, b] that is outside the image of f. Thus, f is not surjective and so not bijective. 
So [a,b] must be uncountable. 


Cantor’s Theorem 


We now prove the remarkable fact that for every set X, the power set P(X) (the set of all 
subsets of X) has strictly larger cardinality than X. 


7.25. Cantor’s Theorem. For any set X, |X| < |P(X)]. 


Proof. Fix a set X. We must show |X| < |P(X)| and |X| #4 |P(X)|. On one hand, the 
map k: X + P(X) given by k(x) = {x} for x € X is readily seen to be an injection. So 
|X| < |P(X)|. On the other hand, to show |X| 4 |P(X)|, we must prove that every function 
f: X — P(X) is non-bijective. Fix such a function f; it suffices to prove f is non-surjective. 
So, we must show 4S € P(X), Va € X, S # f(x). 

The rest of the proof is a variation of Cantor’s diagonal argument for sequences. We 
choose S = {z € X : z ¢ f(z)}, which is a subset of X and hence a member of P(X). 
We prove Yx € X, S 4 f(x) by contradiction. Assume, to get a contradiction, that dao € 
X, S = f(a). Let us ask whether the object z = x is in S. By definition of S and the fact 
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that S = f(xo), vo € S iff ao Z f (xo) iff zo ¢ S. We now have reached the contradiction 
“to E SS To g S.” 


Taking X = Z,„o in this theorem, we see that P(Zs9) is uncountable. As another 
example, just as the uncountable set R is a larger infinite set than Z or Q, we see from 
Cantor’s Theorem that P(R) is an even larger set than R. Iterating the theorem, we get a 
chain of uncountable sets, each strictly larger in cardinality than the previous one: 


IR| < [P(R)| < |P(P(R))| < [P(P(P(R)))| < -+ 


The mind-boggling tower of infinities never ends; for if we tentatively assumed there were 
a largest set X, Cantor’s Theorem would provide an even larger set P(X). 


Proof of the Schréder—Bernstein Theorem (Optional) 


We conclude our introduction to cardinality by proving the Schréder—Bernstein Theorem. 
We begin with a special case of the main result. 


7.26. Lemma. For all sets X and W, if W C X and there exists an injective function 
f: X — W, then there exists a bijective function h : X > W. 


Proof. Fix arbitrary sets X and W. Assume W C X and f : X — W is a one-to-one 
function. When X is infinite, the function f may not be onto. The following ingenious 
construction uses f to build a bijection h : X — W. Recursively define a sequence of sets 


Ao > Ay > Ag Drs > An > Any Diets 


by letting Ao = X, Ai = W and An42 = f[An] for all n E€ Z>o. We check by induction on 
n that Anyi C Ap for all n € Z>o. This is true when n = 0 because W C X by assumption. 
This is true when n = 1 because Ag = f[X], A, = W, and we assumed f maps X into 
the codomain W. Now fix n € Z>2, and assume that An C An—1 C An-2 C ++: C Ao 
is already known. We must prove Ani C An. Since An_1 C An_—2, we conclude that 
f[An-1] © f[An—2], so An+1 C An as needed. 

The next step is to define Bn = An — An+i for all n € Z>ọ and B = NÇ o An. It is 
routine to check that the sets Bo, B,,...,Bn,...,B are pairwise disjoint and have union 
X, whereas the sets B1, Bo,...,Bn,...,B are pairwise disjoint and have union W. See 
Figure 7.8, where X is the entire box, and W is the part of the box below the thick line. 

A crucial observation is that f[B,] = Bn+2 for all n € Zso. To verify this, note that 
injectivity of f implies f[A — C] = f[A] — f[C] for all A,C C X (see Exercise 14 in §5.8). 
So for each n € Zo, 


fIBn] = f[An = An+1] = FIAn] = f[An+1] = An+2 = An+3 = Bye. 


Next, using injectivity of f, f[B] = B follows from the computation 


(An = (N Fln] = [ An = () 4; =B. 
n=0 n=0 j=2 


n=0 
(The last equality uses the fact that Ag C A; C Ag.) 
Since f is injective, all the restrictions f|g, and f|g are also injective. Since f[B] = B 
and f[Bn] = Bn+2 for all n € Zso, we can restrict the codomains of f|g,, and f|g to get 


f[B] = f 
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FIGURE 7.8 
Visualizing the proof of the Schröder-Bernstein Theorem. 


bijections fn : Bn > Bn+2 and f : B — B. Now comes the final miraculous idea: define a 
function h by 
fr() if x € Bn where n € Z>o is even; 
h(x) = x if x E€ By where m € Zo is odd; 
f.e(z) ife@e B. 


As shown on the left edge of the box in Figure 7.8, h is defined by gluing together the 
bijections fn : Bn —> Bn+2 for n even, Idg,, : Bm —> Bm for m odd, and fẹ : B —> 
B. The domains of these bijections are the pairwise disjoint sets Bo, B1, Bo, Bs, B4,..., B 
whose union is X; and the codomains of these bijections are the pairwise disjoint sets 
Bo, Bı, Bs, B3, Be,..., B whose union is W. We see at once that h is a well-defined function 
mapping the domain X onto the codomain W, and h is one-to-one. So we have constructed 
the required bijection h : X > W. 


7.27. Schréder—Bernstein Theorem. For all sets X and Y, if |X| < |Y| and |Y| < |X 
then |X| = |Y]. 


Proof. Fix sets X and Y. Assume |X| < |Y| and |Y| < |X|, which means there are injections 
F:X >Y andG:Y —> X. We must prove |X| = |Y |, which means there is a bijection H : 
X > Y. Define W = G[Y], which is a subset of X. Define f : X > W by f(x) = G(F(x)) 
for all x € X. Note that f is injective, since it is obtained from the injective function 
Go F: X —> X by shrinking the codomain to W. Similarly, if we shrink the codomain of 
the injective function G to W = G[Y], we obtain a bijective function G; : Y — W. Since 
W CX and f : X — W is injective, Lemma 7.26 shows that there is a bijection h : X > W. 
Then H = Gr oh: X > Y is a bijection, since it is the composition of the bijections h 
and GT}. 


7.28. Corollary: No-Universe Theorem. There is no set of all sets. 


Proof. Assume, to get a contradiction, that X is a set such that for all sets Y, Y € X. 
Consider the power set P(X) consisting of all subsets of X. Every Y € P(X) is a set, so 
every such Y is in X by hypothesis, so P(X) C X. This implies |P(X)| < |X|, and we 
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saw earlier that |X| < |P(X)|. Now |X| = |P(X)| follows from the Schréder—Bernstein 
Theorem; but this contradicts Cantor’s Theorem. 


Section Summary 


1. Uncountable Sets. A set X is uncountable iff X is not countable. To prove that 
an infinite set is uncountable, it suffices to prove that every f : Z>o > X is not 
surjective. If |X| = |Y |, then X is uncountable iff Y is uncountable. If |X| < |Y| 
and X is uncountable, then Y is uncountable. If Y has an uncountable subset, 
then Y is uncountable. 


2. Examples of Uncountable Sets. R is uncountable. For all a < b in R, the intervals 
(a,b) and [a,b] are uncountable. The set P(Zs) is uncountable. If X has more 
than one element, the set of infinite sequences with values in X is uncountable. 


3. Cantor’s Theorem. For every set X, |X| < |P(X)|. The proof uses the diagonal 
method, starting with any given function f : X — P(X) and constructing a 
subset of X not in the image of f. One consequence of Cantor’s Theorem is that 
there is no set of all sets. 


4. Schréder-Bernstein Theorem. For all sets X and Y, if |X| < |Y| and |Y| < |X], 
then |X| = |Y |. The proof combines injections F : X > Y andG: Y > X to 
build a bijection H : X > Y. When Y C X and G : Y —> X is the inclusion map, 
the construction of H is visualized in Figure 7.8. 


Exercises 


1. For fixed a < b in R, show that h : (0,1) > (a,b), given by h(t) = a + (b — a)t for 
t € (0,1), is a bijection. 


2. Given that the interval [0, 1] is uncountable, prove that every open interval (a, b) 
is uncountable without using Theorem 7.23. 


3. Show that f : (-1,1) > R, given by f(x) = x/(1 — x?) for x € (-1,1), is a 
bijection. 
4. Prove Theorem 7.20. 


5. (a) Complete the proof of Theorem 7.22 by constructing an injection g : Z>o > S. 
(b) The proof shows that the map g in part (a) cannot be surjective. Find the 
specific sequence f constructed by the proof that is outside the image of g. 


6. Use calculus to show that the series X272; f(k)10~* in the proof of Theorem 7.23 
converges to a real number in (0,1). [Hint: Compare to the geometric series 
Doi (4/10)* and 39,55 (7/10)*.] 

7. (a) What goes wrong with the proof of Theorem 7.23 if we use X = {0,4} instead 
of X = {4,7}? (b) What goes wrong with the proof if we use X = {7,9} instead 
of X = {4,7}? 


8. Let X be a fixed countably infinite set, and let Y be a fixed uncountable set. 
Decide (with proof) whether each set of functions is countable or uncountable. 
(a) the set of all f : {1,2,3} > X 
(b) the set of all g : {1,2,3} > Y 
(c) the set of all h : X —> {1,2,3} 
(d) the set of all k : Y > {1, 2,3} 
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10. 


11. 


12. 


13. 


14. 


15. 


16. 


17. 


18. 
19. 


20. 


21. 


22. 
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In the proof of Lemma 7.26, check that the sets Bo, Bi,...,By,..., B are pairwise 
disjoint, the union of these sets is X, and the union of these sets excluding Bo 
is W. 

(a) Prove that for any injective function f : X — Y and all sets A, C X, 
FIN 9 An) = Nee f[An]- (b) In the proof of Lemma 7.26, explain in detail why 


Let W = {2k : k € Z}, and define an injection f : Z > W by f(n) = 4n for all 
n € Z. In the proof of Lemma 7.26, describe the sets An, Bn, B, and the bijection 
h:Z> W. 


Define an injection f : Z>9 > Zs, by f(n) =n +3 for all n € Z>o. In the proof 
of Lemma 7.26, describe the sets An, Bn, B, and the bijection h : Z>9 > Z>1. 


In Lemma 7.26, suppose f : X — W happens to be a bijection. Describe the sets 
An, Bn, B and the bijection h : X — W in this case. 


In Lemma 7.26, suppose W = X. Describe the sets An, Bn, B, and the bijection 
h: X — W in this case. 


Define F,G : Z>o > Zyo by F(n) = 2n and G(n) = 2n — 1 for n € Zyo. Trace 


through the proof of the Schréder—Bernstein Theorem to construct a bijection 
H : Z>0 > Zso from F and G. 


For any sets X and Y, let Y* be the set of all functions f : X > Y. Prove: if 
|X| =k and |Y| = n, then |Y*| = në. 

Use a bijection to prove: for all sets X, Y, Z, if X NY = Ý then |Z*YY| = 
ee XZY 

Use a bijection to prove: for all sets X, Y, Z, |(ZY)*| = |Z***|. 

Given sets X and Y, a function f : X — Y, and an injection g : Y > X, show 
there exist sets A, B, C, D such that {A, B} is a set partition of X, {C, D} is a set 
partition of Y, C = f[A], and B = g[D]. (Hint: Let Z = X — g[Y] and h= go f; 
then define A to be the intersection of all sets U C X with ZU A[U] C U.) 


Use the previous exercise to give another proof of the Schroder—Bernstein Theo- 
rem. 

A set S C Zyo is closed under multiplication iff Va,b E€ S,a -b E€ S. Let X be the 
set of all subsets of Zo that are closed under multiplication. Is X countable or 
uncountable? Prove your answer. 


A set S C Z>o is closed under addition iff Va,b E€ S,a +b € S. Let Y be the set of 
all subsets of Zo that are closed under addition. Is Y countable or uncountable? 
Prove your answer. 


Review of Functions, Relations, and 
Cardinality 


Tables 7.1-7.6 on the following pages review the main definitions and theorems covered in 
the preceding chapters on functions, relations, and cardinality. All unquantified variables in 
the tables represent arbitrary objects, sets, functions, or relations (as required by context). 


Functions and Relations 


1. Arrow Diagrams. We can visualize a relation R from X to Y by drawing an arrow 
from a dot labeled x to a dot labeled y for each (x,y) € R. The relation is the 
graph of a function f : X — Y iff each x € X has exactly one arrow leaving 
it. Such a function f is injective iff each y € Y has at most one arrow entering 
it; f is surjective iff each y € Y has at least one arrow entering it; and f is 
bijective iff each y € Y has exactly one arrow entering it. The relation R7! is 
found by reversing all arrows. We get the arrow diagram of a composition So R 
by concatenating R-arrows followed by S-arrows. 


2. Cartesian Graphs. We can visualize a relation R on R by the graph consisting 
of all points (x,y) in the plane with (x,y) € R. The relation is the graph of a 
function f : X — Y iff the points drawn all lie within the rectangle X x Y, and 
for each xp € X the vertical line x = xo intersects the graph in exactly one point. 
Such a function f is injective (resp. surjective, bijective) iff for all yo € Y, the 
horizontal line y = yo intersects the graph in at most one (resp. at least one, 
exactly one) point. The relation R~! is found by reflecting the graph in the line 
y=. 

3. Digraphs. We can visualize a relation R on X by drawing a single copy of X with 
arrows from x to y for each (x,y) € R. R is reflexive on X iff there is a loop at 
each vertex x € X; R is symmetric iff every arrow from x to y is accompanied by 
the reverse arrow from y to x; and R is transitive iff whenever we can go from x 
to y to z by following two arrows, the arrow directly from x to z is also present. 


4. Proving Functions are Well-Defined. We often define functions g : A > B by 
giving a formula specifying g(a) for each x € A. To be sure we do have a well- 
defined function, we must check three things: g(x) is in the codomain B for all 
x € A; for each x € A there exists at least one associated output g(x); and for all 
1,02 € A, if £1 = x2, then g(x1) = g(x2). The last condition can be restated: for 
all x,y,z, if (x,y) and (a, z) belong to the graph of g, then y = z. It is especially 
crucial to prove this single-valuedness condition when members of the domain A 
have multiple names (e.g., if A is a set of equivalence classes). If f(a) is computed 
using one of the names of x, we must verify that changing the name of the input 
x does not change the value of the output f(x) (although the name of the output 
could change). 


5. Proving Function Equality. To prove that functions f and g are equal, first check 
that f and g share the same domain (say X) and the same codomain (say Y). 
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Function Definitions. Here, f is a function with domain A and codomain B. 


Concept 
function [formal version] 


f:A>B 


graph of f: A> B 

f is injective (one-to-one) 
f is surjective (onto) 

f is bijective 

direct image f[C] 
preimage f~'[D] 
function equality 


identity function Id4 
constant function c 
inclusion map j4,B 
pointwise sum f +h 
pointwise product f-h 
composition go f 


inverse of f: A> B 

right inverse of f 

left inverse of f 

invertible function 
restriction f|s (for S C A) 
characteristic function Xs 
h = glue(h; : 7 € I) 


fiber of f over y 


Definition 
ordered triple f = (A, B,G) where A and B are sets, 
G C Ax B, and Yz € A, a!y, (x,y) E€ G. 
A is the domain of f, B is the codomain of f, and 
G is the graph of f; y = f(x) means (x,y) € G. 
f is a function with domain A and codomain B, meaning: 
Vz, Yy, y = f(w) = (£ € AAy € B), and 
vre A,3!y,y = f(z). 
G={(a, f(a) : z € A}. 
Va E€ A, Yc € A, f(a) = fl) >a =c. 
Vy € B, Jx € A,y = f(x). 
f is one-to-one and f is onto; i.e., Vy € B,3!x € A, y = f(x). 
y € f[C] if 3x € C, y = f(a). 
x € f [D] iff x € A and f(z) € D. 
For f: A> Bandg: A > B’, 
f= g iff A=A’ and B= B' and Vz € A, f(x) = g(x). 
Ida : A — A where Id4 (x) = z for all x € A. 
f: A— B where f(x) = c for all x € A (c € B is fixed). 
j: A—> B where j(x) = x for all x € A. 
(f +h)(x) = f(x) + h(x) for « € A (codomain is R). 
(f -h)(x) = f(x)h(x) for x € A (codomain is R). 
For f: A> B and g: B >C, 
go f: A —> C satisfies (g o f)(x) = g(f(x)) for x € A. 
fT: B —> A where Vz € A, Vy € B,y = f (£) & x = ft (y). 
g: B— A with fog = Ids. 
h: B —> A with ho f =Ida. 
function that has a (two-sided) inverse. 
f\ls : S > B where f|s(x) = f(x) for x € S. 
xs(x) = 1 for x E€ S, xs(x) = 0 fora ZS. 
For hi : A; > Bi with graphs H; agreeing on overlaps, 
h: User Ai > Ucr B; has graph Ucr Hi. 
So h(x) = hi(x) for all x € Aj. 
F HUH = {ee A: fe) = yh. 


LI 
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TABLE 7.2 
Relation Definitions. 


Concept 

R is a relation from X to Y 
R is a relation on X 

infix notation aRb 

image R[C] 

identity relation on X 
inverse of relation R 
composition of R and S$ 
R is reflexive on X 

R is symmetric 

R is transitive 

R is antisymmetric 

R is irreflexive on X 
equivalence relation on X 
equivalence class of a 
quotient set X/R 
equivalence relation ~ 
congruence modulo n 
integers modulo n 


addition modulo n 
multiplication modulo n 
set partition of X 


eqv. rel. ~p of set ptn. P 
partial order on X 

partially ordered set (poset) 
total order on X 

z is greatest element of S 

z is least element of S 
well-ordering on X 


TABLE 7.3 
Cardinality Definitions. 


Concept 

|x| = |¥| 

Ix| < iY] 

X has size n (n € Z>0). 
X is finite. 7 
X is infinite. 

X is countably infinite. 
X is countable. 

X is uncountable. 


Definition 

RCXxY. 

RCXxX. 

aRb iff (a,b) € R. 

y € R|C] iff 3x € C, (x,y) E€ R. 

(a, p a 

(y, z) € ff (x,y) € R. 

(a, eer (z,y)ERA(y,2z)€S. 

Ya E€ X,(a,a) € R. 

Va, Vb, (a,b) E R> (b,a) ER. 

Va, Vb, Vc, ((a,b) € RA (b,c) € R) > (a,c) E€ R. 

Va, Vb, ((a,b) € RA (b,a) E R) = a=b. 

Va € X,(a,a) Z R. 

symmetric, transitive, reflexive relation on X. 

x € [alr iff rRa (iff aRe). 

S € X/R iff da € X, S = [a]r. 

for f: X >Y and u,v E€ X,u-y v iff f(u) = fw). 

For a,b € Z, a =n b iff n divides a — b. 

Z/n = {[a]=,, :a Z} = {[0]n, In,- [n 1n}, 
where [r]n = {r + kn : k € Z}. 

la]n On [bln = [a + b]n for a,b € Z. 

[a]n n [b]n = [ab]n for a,b € Z. 

a set P of nonempty subsets of X where 
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Yz € X,3S € P,x € SandVS,TEPSATSSNT=O. 


For a,b E€ X,a~pbiffdSe PaceSAbesS. 

antisymmetric, transitive, reflexive relation on X. 

pair (X, <) where < is a partial order on X. 

partial order where Yx, y E X,£ Sy Vy<r. 

zE SNVyYES,y<z. 

ZzESAVWVE S,z<y. 

partial order on X where each nonempty subset 
of X has a least element. 


Definition 
There exists a bijection f : X > Y. 
There exists an injection g : X > Y. 


|X| = n iff there exists a bijection f : {1,2,...,n} > X. 


an € Z>o,|X| =n. 

X is not finite. 

There exists a bijection f : Z>o > X. 
X is finite or X is countably infinite. 
X is not countable. 
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TABLE 7.4 
Theorems on Relations. 


Direct Images 
Empty Image 
Codomain Property 
Monotonicity 

Image of Union 
Image of Intersection 
Image of Difference 


Setup: R is a relation (or function) from X to Y. 
RØ] = 0. 

RIA] CY. 

If AC C, then R[A] C R[C]. 

Vier Aj] = Vier Rf[A;]. 
[Mier Aj] C Miecz R[A:]. 


R 
R 
RIA] — RIC] c RÍA- C]. 


Inverses and Compositions 
Inverse of Identity 

Double Inverse 

Inverse of Union 

Inverse of Intersection 
Composition with Identity 
Associativity 

Inverse of Composition 
Monotonicity Properties 


Empty Set Properties 


Setup: R is a relation from X to Y. 
Iz' = Íx. 

(R7!) = R. 

(SUT) E= ST rU T hea T EST, 
(GAT =S ta T eee ee E, 
Ro Ix = R and Iyo R=R. 
To(SoR)=(ToS)oR. 

(Res) = S7} o RH. 

X CY > Ix Cly;if RCS, then 
R'CS!,ToRCT0oS,and RoT CSoT. 
Ig = Ó and 07t = Ø and RoW = =o R. 


Distributive Laws (SUT)oR=(SoR)U(TOR). 
To(SUR)=(ToS)U(ToR). 
Partial Distributive Laws (SOT)oRC(SoR)N(ToR). 
To(SNR)C(ToS)N(ToR). 


Then fix an arbitrary xo € X, and use the definitions of f and g to prove f(x) = 
(Xo). 

6. Proving Invertibility. We can prove that a function f : X — Y is invertible by 
proving f is one-to-one and onto. Alternatively, we can prove invertibility by 
defining a proposed inverse g : Y + X, then proving fog = Idy and go f = Idx. 


7. Common Pitfalls. Remember that composition of relations (or functions) is usu- 
ally not commutative: RoS # SoR for most choices of the relations (or functions) 
R and S. Do not confuse the preimage notation f~'[B] with the notation f~t 
for an inverse function; preimages are defined for any function f, even when the 
inverse function f~! does not exist. The two-sided inverse of f is unique when it 
exists, but left inverses and right inverses are not unique in general. Two functions 
with the same domain and the same values are not equal unless their codomains 
also agree. 


Equivalence Relations, Set Partitions, and Partial Orders 


1. Equivalence Relations. A relation R on a set X is an equivalence relation iff R 
is reflexive on X, symmetric, and transitive. This means that for all a,b,c € X, 
aRa; if aRb then bRa; and if aRb and bRc then aRc. The equivalence class |a] r 
is the set of all elements of X related to a under R: [alr = {x € X : eRa}. 


2. Theorem on Equivalence Classes. Given an equivalence relation R on a set X and 
a,b € X, [alr = [b]r iff aRb iff bRa iff b € [alr iff a € [b]r. Two equivalence 
classes of R are either disjoint or equal. 
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TABLE 7.5 
Theorems on Functions. 
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Preimages Setup: f:X >Y. 

Empty Preimage FO = A. 

Domain Property f [BJ CX. 

Preimage of Codomain folly] =X; 

Monotonicity If B C D, then f~'[B] C f7"[D] 

Preimage of Union PP Whee Bd = Uer FBA. 

Preimage of Intersection fo Mier Bi] = Ne 7 f~"[Bi]. 

Preimage of Difference f'[B — D] = f7+[B] — foi [DI]. 

Image of Preimage YBCY, fl ~B] C B; equality holds if f is surjective. 
Preimage of Image YVA C X, AC f'[f[|A]]; equality holds if f is injective. 
Function Composition Setup: f: X >Y, g:Y >Z, h:Z >W. 
Associativity (hog)of=ho (go P 

Identity f oldx = f = Idy of. 


Image under Composition 
Preimage under Composition 


(g ° PIA] = gIfIAI]- 
(go f) IC] = flg [C]. 


Injections/Surjections/Bijections 
Composing Injections 

Composing Surjections 

Composing Bijections 

Backwards Rule for Injections 
Backwards Rule for Surjections 


Setup: f: X >Y andg:Y > Z. 

If f and g are injective, then go f is injective. 
If f and g are surjective, then go f is surjective. 
If f and g are bijective, then go f is bijective. 
If go f is injective, then f is injective. 

If go f is surjective, then g is surjective. 


Inverse Functions 
Two-Sided Inverse 
Bijectivity and Invertibility 


Inverse of Identity 
Double Inverse 

Inverse of Composition 
Left Inverse Criterion 
Right Inverse Criterion 


Setup: f: X >Y, g:Y >X, h: Y >Z. 
g = f~! iff fog = Idy and go f = Idx. 
f~} exists iff f is bijective; 
in this case, f~* is unique and bijective. 
Idx is invertible, and Idx! = Idx. 
If fT} exists, then (f7 1nd =f: 
If f and h are invertible, then (ho f) = f7 o h™t. 
For X #9, f has a left inverse iff f is injective. 
f has a right inverse iff f is surjective. 


Pointwise Operations 
Commutativity 
Associativity 

Identity 

Inverses 


Distributive Law 


a fgh: X OR. 
f+9=g+ fand f-g=g:- f. 
(ft+g)+th=f+(g+h) and (f- 9) 


ff 3 
f+(-f)=0=(-f) +f and 
f 


when g(x) is never zero) g- (1/g) = 1 = (1/g) - g. 


(g +h) =(f-9)+(f-h). 
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TABLE 7.6 


Theorems on Cardinality. 


Cardinal Equality: 
Reflexivity 
Symmetry 
Transitivity 

Disjoint Unions 


Products 


Power Sets 

Cardinal Ordering: 
Reflexivity 
Antisymmetry 
Transitivity 

Products 
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|A| = |A|. 
|A| = |B| = |B| = |A]. 
(a = |B| A |B| = |C) > |A| = |C]. 


If |A;| = |Aj| for all i € I and A; N Aj = 0 = A; N Aj 
for all i # j in T, then | Uez Ail] =|Uje, 4il- 

If |A;| = |A; for 1 <i<n, 

then |Ai x + x An| =|A, x +--+ x Ahl. 

If |A| = |B|, then |P(A)| = |P(B)|. 

|A] < |Al. 


(A| < |B| A|B| < |A|) > |A| = |B| (Schröder-Bernstein Theorem). 


(|A| < [B| ^ |B] <|Cl) = |A] < |C]. 
If |A;| < |A; for 1 <i<n, 
then |A x +--+ X An| <|Aj -x Aj |. 


Finite Sets: 
Uniqueness of Size 
Sum Rule 

Product Rule 

Power Set Rule 
Preserving Finiteness 


For m,n € Zo, |X| = m and |X| = n imply m =n. 

For pairwise disjoint finite X1,..., Xs, |U;_, Xe] = op, |Xel- 
For finite X1,...,Xs, [Xi x- x Xs| = Ifi |X 

For finite X, [P(X)| = 2*1. 

For finite X,X1,...,Xs, Uz_, Xk and Nj- Xk 

and Xı x--- X Xp and P(X) and all subsets of X are finite. 


Countable Sets: 
Countability Criteria 


Countably Infinite Sets 
Ordering Property 
Subset Property 
Product Property 
Union Property 


X is countably infinite iff |X| = |Zsol. 

X is countable iff |X| < |Zso| iff 3 bijection f : X > Y C Zyo. 
Z>a, L<a, Z, Q, Z*, Q? (where k € Z>o) are countably infinite. 
If |X| <|Y| and Y is countable, then X is countable. 

Any subset of a countable set is countable. 

A product of finitely many countable sets is countable. 

A union of countably many countable sets is countable. 


Uncountable Sets: 
Ordering Property 
Uncountable Sets 


Cantor’s Theorem 


If |X| < |Y| and X is uncountable, then Y is uncountable. 

R, (a,b), [a,b], and P(Zs0) are uncountable (for real a < b). 
For |X| > 1, the set of sequences f : Z>o + X is uncountable. 
For all X, |X| < |P(X)|. 
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3. Set Partitions. A set partition of X is a set of nonempty subsets of X such that 
every a € X belongs to exactly one set (block) in the set partition. 


4. Correspondence Between Equivalence Relations and Set Partitions. Given an 
equivalence relation R on a set X, the quotient set X/R = {[a]jrp : a € X} is 
a set partition of X. Given a set partition P of X, the relation ~p, defined for 
a,b E€ X by a ~p b iff a and b belong to the same block of P, is an equivalence 
relation on X. The function sending R to X/R is a bijection from the set of 
equivalence relations on X to the set of set partitions of X. The inverse function 
sends a set partition P of X to ~p. 


5. Examples of Posets. Number systems such as R are totally ordered by <. A 
collection of sets is partially ordered by C. A set of positive integers is partially 
ordered by divisibility. Other examples include subposets, product posets, and 
concatenation posets. 


6. Well-Ordering Property. The poset (Zso,<) is well-ordered, meaning that ev- 
ery nonempty subset of positive integers has a least element. This property is 
equivalent to the Induction Axiom. 


Review Problems 


1. Complete the following definitions and proof templates: 
a) “R is a relation from X to Z” means... 


b) For a relation S and set A and object w, “w € S[A]” means... 

c) For relations R and S and objects x,y, “(x,y) € So R” means... 

d) For a relation T and ordered pair (a,b), “(a,b) € T7!” means... 

e) “F is a function from X to Y” means... 

f) Given a function g with graph G and objects u,w, “w = g(u)” means... 
g) Given a set Z and objects k,m, “(k,m) € Iz” means... 

h) Given f : X > Y and A C X and an object y, “y € f[A]” means... 


i) Given f : X > Y and B CY and an object x € X, “x € f~'[B]” means... 

j) “h : A > C is one-to-one” means... 

k) “p : D > E is onto” means... 

1) Given F : Z > W and G: Z > W, outline a proof that F = G. 

m) Given relations R and S from X to Y, describe two ways to prove that 
R=S. 

n) Given f,g,h : R > R and t € R, define (f - g)(¢+1), (f o g)(t + 1), and 
g+ f)(t+1). 

o) “R is reflexive on X” means... 

) “S is symmetric” means... 

q) “T is transitive” means... 

r) “X has n elements” means... 

s) “|U| < |V|” means... 

2. Fix a,b€ R with a 40. Define f : R > R by f(x) = ax +b for alla ER. 

a) Is f one-to-one? Is f onto? Prove your answers. 

b) Repeat (a) replacing R by Q everywhere in the setup. 

c) Repeat (a) replacing R by Z everywhere in the setup. Does the answer to (c) 
depend on the particular value of a? 


© 


3. For each set X and equivalence relation R on X, explicitly describe the requested 
equivalence classes. 
(a) X = {a,b,c,d}, R = {(a,a), (b,b), (c, c), (d, d), (b,c), (c, b), (a, d), (d,a) y; find 
lala, [b]r, (ela, and [d]r. 
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10. 


11. 


12. 


13. 
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(b) X =Z, R is congruence mod 5; find [7]pr. 
(c) X= {1, 2, 3,4, 5}, R= Ix; find [3]r- 
(d) X =Rso x Rso, ((x, y), (u,v)) E€ R means z + y = u + v; draw [(1, 2)]p. 


Give a specific example of a function f : Z>o > Zso with the stated properties. 
Prove that your examples work. (a) f is one-to-one and not onto. (b) f is onto 
and not one-to-one. (c) f is not one-to-one and not onto and not constant. (d) f 
is one-to-one and onto and f # Idz,,. 


Suppose S,T,U are finite sets with |S| = a and |T| = b and U C S. Which 
formulas are always true? Explain. (a) |SUT| = a+b. (b) |S x T| = ab. (c) |U| =c 
for some integer c < a. (d) |P(S)| = 2°. (e) |S -T| = a — b. 

Which of the following sets are countably infinite? (a) Ø (b) {1, 2,3} (c) Z>o (d) Z 
e) Q (f) Z x Z (g) R (h) the set of all infinite sequences of 0s and 1s (i) P(Z). 


i) Draw an arrow diagram and a digraph for each relation. (ii) Is each rela- 
tion reflexive on X = {1,2,3,4}? symmetric? transitive? Explain. (iii) Compute 
R[{1,3}] and Ro R`! for each R. 

a) R = {(1, 1), (2, 2), (3, 3), (4, 4), (1, 3), (3, 4), (1, 4), (2, 4)}. 

) R= {, 2), (2, 1), (3, 4), (4, 3)}- 

) R=(X x X)-—Ix. 

) R = {(1, 1), (3,3), (1,3), (3, 1), (2, 2), (4, 4), (2, 4), (4, 2)}. 

) Prove: for all f,g,h: ROR, (f-g)oh=(foh)-(goh). 

) Disprove: for all f,g,h: ROR, fo(g-h)=(fog)-(foh). 

c) Give four examples of functions f : R —> R such that for all gh: R > R, 
fo(g-h) =(fog):(foh). 

Suppose R is an equivalence relation on X = {1,2,3,4,5,6} containing the or- 
dered pairs (1,4), (6,3), and (3,2). (a) Draw the digraph of R. Give the answer 
with the fewest possible arrows. (b) Find all distinct equivalence classes of the 
relation R in (a). (c) What is the set partition X/R? 

(a) Define f : Ro > R by f(x) = 2% for all real x # 0. Prove f is one-to- 
one but not onto. What is f [Ryo]? (b) Define g : R x R > R x R by g((z,y)) = 
(a+y,2—y) for all (x,y) € RxR. Is g one-to-one? Is g onto? Prove your answers. 


Suppose X and I are fixed sets, R; is a fixed relation on X for i € J, and 
R = Uj, Ri. Prove or disprove each statement. (a) R is a relation on X. (b) If 
every R; is reflexive on X, then R must be reflexive on X. (c) If every R; is 
symmetric, then R must be symmetric. (d) If every R; is transitive, then R must 
be transitive. 


Given functions f : X —> U and g : Y > V, define h : X x Y ~ U x V by 
h((x,y)) = (f(a), g(y)) for all (x,y) € X x Y. 

a) Prove: h is a function. 

b) Prove: if f is one-to-one and g is one-to-one, then h is one-to-one. 

c) Prove: if f is onto and g is onto, then h is onto. 

d) Prove: if f is a bijection and g is a bijection, then h is a bijection. 

a) Prove: for all relations R and all sets A and B, R[A] — R[B] C R[A — B]. 

b) Give an example to show that equality does not always hold in (a). 

c) Suppose R is a relation from X to Y and A, B C X. Which of the following 
conditions on R will force equality to hold in (a)? Select all that apply: R is the 
graph of a function from X to Y; R is the inverse of the graph of a function from 
X to Y; Ris the graph of an injection from X to Y; R is the graph of a surjection 
from X to Y. 
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14. Prove: for all f : X —> Y and g : Y —> Z and all C C Z, (go f)“[C] = 
f [g7 C]. 

15. Prove: for all f : X —> Y, f is injective iff (for all A,C C X, f[ANC] = 
FIA] 9 FCI). 

16. For each set X and relation R, prove that R is an equivalence relation on X. 
(a) X = Z’, (a,b,c)R(d,e, f) means a +b+c=d+e+ f for a,b,c,d,e, f € Z. 
(b) X =R, (x,y) € R means y — z € Q for x,y € R. 
(c) X = Rso x Rso, ((x,y), (a,b)) € R means there exists r € Ryo with a = rz 
and b = ry, for xz,y,a,b E R. 


17. Let X = [0,3] x [0,3]. For each equivalence relation R below, draw a picture of 
X and the equivalence classes |(1,3)]r and [(2, 2)] r- 
(a) R is defined by: ((a, b), (c, d)) € R iff a = c, for (a,b), (c,d) € X. 

(b) R is defined by: ((a, b), (c, d)) € R iff b = d, for (a,b), (c,d) € X. 

(c) R is defined by: ((a,b), (c, d)) € R iff a +b = c + d, for (a,b), (c,d) E€ X. 

(d) R is defined by: ((a, b), (c, d)) € R iff ab = cd, for (a,b), (c,d) € X. 


18. Let X and I be fixed, nonempty sets. For each iz € J, let R; be a fixed equivalence 
relation on X. (a) Prove: R = (),-, Ri is an equivalence relation on X. (b) Prove: 
for all b € X, [blk = Nierlblr:- 


19. Prove or disprove: for all sets X, Y, P, Q, if P is a set partition of X and Q is a 
set partition of Y, then PU Q is a set partition of X UY. 


20. Let R be a relation from X to Y. For each identity below, find a condition on R 
that will guarantee the truth of the identity. (Possible conditions are: R is any 
relation whatsoever; R is the graph of a function; R is the graph of an injection; R 
is the graph of a surjection; R is the graph of a bijection.) Prove that your answer 
works, but give examples showing that more general relations do not work. 

a) For all B C Y, R[R71[B]] = B. 

b) For all A C X, R7}[R[A]] = A. 

c) For all A,C C X, R[AUC] = RJA] U RJC]. 

a) Define an explicit bijection f : Z > Zso. (b) Prove: for all sets X and Y, if 

X and Y are disjoint countably infinite sets, then X U Y is countably infinite. 

c) Does (b) remain true if X and Y are not disjoint? 

22. (a) Define f : Z4) > Z>o by f(a, b,c) = 2°3ł5°7% for a,b,c,d € Z>o. Prove that 
f is injective but not surjective. (b) Use (a) and a theorem to prove that Z4, is 
countably infinite. 


23. Let X be an infinite set, and let Z be the set of all f : X — X such that f(x) # x 
for all x € X. Prove that Z is uncountable. 


24. Let Y be the set of all f : Z>o  Zso such that f(n) = n for all but finitely 
many n € Zyo. Is Y countable or uncountable? Prove your answer. 


tel 
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Real Numbers (Optional) 


8.1 Axioms for R and Properties of Addition 


The goal of this optional chapter is to present an axiomatic development of the real number 
system. Such a development is needed to put the subject of calculus (also called analysis) on 
a firm logical footing. We assume no prior knowledge of real numbers or any other number 
system (even the integers), instead deducing all needed facts from the initial axioms. We 
do assume that facts about logic, proofs, sets, relations, and functions are already known. 
Thus we may invoke any results from previous sections excluding theorems about integers 
and rational numbers from Chapter 4. These theorems will become available later after we 
formally define Z and Q. 


Undefined Terms and Initial Definitions 


Mathematical theories based on axioms were described in §2.1. Like all such theories, the 
axiomatic development of the real numbers begins with certain undefined terms. Our theory 
uses the following eight undefined terms: 


a) a set R called the set of real numbers 
b) a function ADD : R x R > R called addition 


c) a real number 0 € R called zero 


e) a function MULT: R x R > R called multiplication 
f) 


a function MINV : Ro > R called multiplicative inverse 
g # 
) 


a real number 1 € R called one 


( 
( 
( 
(d) a function AINV : R > R called additive inverse 
( 
( 
( 
( 


h) a relation LEQ on R called the ordering. 


The next definition introduces the standard arithmetical symbols for the functions and 
relations listed above. 


8.1. Definition: Arithmetical Operators. 
(a) For all x,y € R, define the sum of x and y to be £ +y = ADD(z, y). 
(b) For all x € R, define the negative of x to be —x = AINV(z). 


(c) For all x,y € R, define the difference of x and y to be x — y = x + (—y). The function 
sending (x,y) to x — y is called subtraction. 


(d) For all x,y € R, define the product of x and y to be x- y = MULT(z,y). We often 
abbreviate x+y as xy. 


(e) For all x £0 in R, define the inverse of x to be 2~+ = MINV(z). 
(£) For all z,y € R with y 4 0, define x divided by y to be x/y = z- (y+). We also write 7 
for x/y. The function sending (x,y) to x/y is called division. 
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g) For all x,y € R, define x < y iff (x,y) € LEQ. 
h) For all x,y € R, define x > y iff y < x. 


( 
( 
(i) For all z,y € R, define x < y iff (x < y and z Æ y). 
( 


) 

j) 

k) A chain of inequalities such as x < y < z < w abbreviates “x < y and y < z and z < w.” 
y y 


For all x,y € R, define x > y iff y < a. 


Note that addition and the other arithmetical operations are assumed to be functions 
with codomain R. This means, for example, that given any two real numbers x and y, there 
exists a unique real number x +y. The fact that the output x + y must belong to the set R 
is called closure of R under addition. Thus, as part of the axiomatic setup, we are assuming 
that R is closed under addition, additive inverses, multiplication, and multiplicative inverses 
of nonzero real numbers. It follows that R is closed under subtraction and division by nonzero 
real numbers. Also, the special constants 0 and 1 belong to R. Starting from 1, we can define 
some new real numbers as follows. 


8.2. Definition of Some Specific Numbers. Define 2 = 1 +1,3 =2+1,4=3+1, 
5=4+1,6=5+1,7=6+1,8=7+1,9= 8+1, and 10 = 9+ 1. By closure, the symbols 
2 through 10 are real numbers, called two, three, four, five, siz, seven, eight, nine, and ten 
(respectively). 


Axioms 


We list all the axioms for R in this subsection. We divide the axioms into four groups: axioms 
for addition, axioms for multiplication, basic ordering axioms, and the completeness axiom. 
The axioms in each group, and properties derived from them, are discussed in detail later. 
For emphasis, we list certain closure properties as axioms, even though these properties are 
already implicit in the initial descriptions of the undefined terms. We begin with the five 
axioms for addition. 


8.3. Axioms for Addition. 

Al. Closure: For all x,y € R, x+y €R and 0 € R and —z €E R. 
A2. Associativity: For all x,y,z € R, (£ +y)+z=x£+ (y +2). 
A3. Additive Identity: For all x € R, z +0 =x =0+ rz. 

A4. Additive Inverses: For all x € R, x + (~x) = 0 = (—2) +2. 


A5. Commutativity: For all x,y E R, £ +y =y+7z2. 


There are seven axioms for multiplication. The first five are exact analogues of the five 
addition axioms, and the last two stipulate relationships between the operations of addition 
and multiplication. 


8.4. Axioms for Multiplication. 

M1. Closure: For all x,y E€ R, x-y € R and 1 € R and if z £ 0, then x7! € R. 

M2. Associativity: For all x,y,z E€ R, (x -y)-z =x- (y- z). 

M3. Multiplicative Identity: For all x € R,x-l=ax=1-z2. 

M4. Multiplicative Inverses: For all x € R, if z Æ 0, then x (£71) = 1 = (£7!) - z. 

M5. Commutativity: For all x,y E R, £ -y =y- x. 

D. Distributive Laws: For all x,y,z € R, x: (y+2z) = (x-y)+(x-z) and (y+z)-x = (y-x)+(z-2). 
N. Nontriviality: 0 £ 1. 
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There are seven ordering axioms. The first four state that (R, <) is a totally ordered set; 
the next two relate this ordering to the algebraic operations of addition and multiplication. 
8.5. Basic Ordering Axioms. 

O1. Reflexivity: For alla € R, x < x. 
O2. Antisymmetry: For all x,y € R, if x < y and y < a, then x = y. 
O3. Transitivity: For all x,y,z € R, if x < y and y < z, then x < z. 
O4. Total Ordering: For all x,y ER, £x <y ory< rz. 
O5. Additive Property: For all x,y,z € R, if x < y, then r+z <y +z. 
O6. Multiplicative Property: For all x,y,z € R, if x < y and 0 < z, then z-z < y- z. 
The final ordering axiom states a crucial technical property of R called completeness. 


To even formulate the axiom, we need the following concepts, which make sense for any 
partially ordered set. 


8.6. Definition: Upper Bounds. For all S C R and zo € R, | zo is an upper bound of S 
iff Vy € S, y < z.| For all S C R, | S is bounded above iff | az ER Vy€ S,y < Z|. 


8.7. Definition: Least Upper Bounds. For all S C R and z € R, 
zo is a least upper bound of S$ | (written | zo = lub S )) iff 


(Vy € S,y < zo) A (Yz ER, if Vy € S,y < x] then zo < x). 


8.8. Completeness Axiom O7. For all S C R, if S is nonempty and S is bounded above 
then S has a least upper bound in R. 


Although not explicitly listed here, we are also assuming various logical axioms governing 
the logical symbols such as >, V, 4, and =. Regarding equality, we are assuming that = 
is reflexive, symmetric, and transitive, and satisfies various substitution properties such as: 
for all x,y,z € R, if y = z, then xz + y = x + z. These properties will be used frequently 
without specifically invoking the underlying logical axioms. 


8.9. Remark: Algebraic Structures. In abstract algebra, we study many algebraic sys- 
tems that satisfy various subsets of the axioms listed above (with the set R replaced through- 
out by an appropriate set of objects). In particular, a system satisfying axioms A1 through 
A5 is called a commutative group; omitting A5 gives the axioms defining a group. A system 
satisfying all the additive and multiplicative axioms is called a field; omitting M4, N, and 
multiplicative inverses gives the axioms for a commutative ring; and further omitting axiom 
M5 gives the axioms for a ring (with identity). If we include all the axioms for R except 
completeness (O7), we get the definition of an ordered field; an ordered commutative ring 
is defined similarly. Here are a few examples using number systems not officially defined 
yet: Z (the set of integers) is an ordered commutative ring that is not a field; Q (the set of 
rational numbers) is an ordered field that does not satisfy the completeness axiom; C (the 
set of complex numbers) is a field that cannot be made into an ordered field; and the set of 
2 x 2 matrices with real entries is a non-commutative ring. 
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Properties of Addition 


We begin our development of the properties of R by deriving some consequences of the 
addition axioms Al through A5. It is helpful to keep track of which axioms are needed 
to derive various results, since the same results also hold in other more general algebraic 
systems for which these particular axioms are true. In particular, we give proofs avoiding 
the commutativity axiom A5 when possible. Our first theorem is a basic but very useful 
uniqueness result. 


8.10. Theorem: Left Cancellation Law for Addition. 
For all x,y,z € R, if x +y = < + z, then y = z. 


Proof. Fix arbitrary x,y,z € R. Assume « + y = x + z; prove y = z. By the substitution 
property of equality, we can add —z to both sides of the assumption z + y = £ +z, 
obtaining (—x) + (x + y) = (—x) + (x + z). (This step also uses closure (A1), since we 
need to know —zx € R, x+y € R, and x + z € R in order for the sums (—x) + (x + y) and 
(—x)+(a+z) to be defined. We make constant use of substitution axioms and closure axioms 
when manipulating complicated expressions, but we seldom mention these axioms in the 
sequel.) By associativity (A2), the previous equation becomes ((—x£)+£)+y = ((—x)+x)+z. 
By the inverse axiom (A4), we get 0+y = 0+ z. Finally, the identity axiom (A3) turns this 
equation into y = z, which is the required conclusion. This proof used axioms A1, A2, A3, 
and A4. 


There is an analogous right cancellation law: for all x,y,z € R, if y +x = z + x, then 
y = z. This law follows from the left cancellation law via commutativity (A5), but it is 
better to adapt the proof given above (add —x to both sides on the right) so that the proof 
only uses axioms Al through A4. The next theorem lists some fundamental facts that can 
be deduced as special cases of the left cancellation law. Similar facts hold based on the right 
cancellation law, but we do not state these explicitly. 
8.11. Theorem on Addition. For all z,y,a,b € R: 
(a) Uniqueness of Additive Identity: If x + y = x, then y = 0. 
(b) Uniqueness of Additive Inverses: If x + y = 0, then y = —2. 
(c) Negative of Zero: —0 = 0. 
(d) Double Negative Rule: For all a € R, —(—a) =a. 
(e) Negative of a Sum: For all a,b E R, —(a + b) = (—b) + (—a). 


Proof. For (a), fix x,y € R; assume x + y = x; prove y = 0. Using the identity axiom (A3), 
our assumption becomes x + y = x + 0. Now use left cancellation, taking z = 0, to deduce 
y = 0. [This result says that zero is the only real number having the additive identity 
property stated in A3.] 

For (b), fix x,y € R; assume z +y = 0; prove y = —«. Using the inverse axiom (A4), our 
assumption becomes x +y = x + (—2x). Now use left cancellation, taking z = —x, to deduce 
y = —«. [This result says that for each fixed real x, —x is the only real number having the 
additive inverse property stated in A4.] 

Now we prove (c). On one hand, taking x = 0 in the inverse axiom (A4) gives 0+ (—0) = 
0. On the other hand, taking x = —0 in the identity axiom (A3) gives 0+ (—0) = —0. Thus, 
—0 = 0 + (—0) = 0. 

For (d), fix a € R. By the inverse axiom (A4), (—a) +a = 0. Using part (b) with x = —a 
and y = a, we deduce y = —2, i.e., a = —(—a). 
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For (e), fix a,b € R. We compute 


(a +b) + ([-8] + [-a]) = a + (b + ([-8] + [-a])) = a + ((b + [-8]) + [-a]) 
=a + (0+ [-a]) = a + [-a] = 0. 


These equalities follow by associativity (A2), associativity again, the inverse axiom (A4), 

the identity axiom (A3), and the inverse axiom (A4). Now use part (b) with x = a +b and 

y = |-b] + [-a] to deduce y = —2, i.e., [~b] + [-a] = — (a + b). 
The above proofs use axioms A1 through A4 only. 


8.12. Remark. If we allow ourselves to use commutativity (A5), we can rewrite part (e) 
of the last theorem as —(a + b) = (—a) + (—b). We might attempt to prove this identity 
using the distributive axiom (D), as follows: 


—(a +b) =(-1)- (a+b) = (—1) -a + (—1) -b = (~a) + (—b). 


However, this calculation requires the property —a = (—1)-a, which has not yet been proved. 
Even worse, this proof refers to multiplication to prove a property involving addition alone. 
Thus the proof would not work in an algebraic structure that had an addition operation 
but no multiplication operation. 


8.13. Theorem on Subtraction. For all x,y,z € R: 


(a)a—-O=2a. 

b) 0— x = -zr 

c)r-r= 

d) -(z —- y) =y - x = (—£) + y. 


(x—y) + y -2)=r- 2. 
(x+2)-(yt2)=0-y. 


Proof. We prove one identity and leave the others as exercises. Fix x,y € R. We compute 


(z =y) =—-(@@+[-yl) =-(-y) + (=r) = y + (-#) = 9-2, 


using the definition of subtraction, then 8.11(e), then 8.11(d), then the definition of sub- 
traction. Using commutativity (A5), we can also write y — x = y+ (—x) = (—x) + y. 


The next result illustrates how specific arithmetical identities can be proved from the 
axioms. 


8.14. Proposition. 2+ 4 = 6 and 3 — 5 = —2. 


Proof. We gradually build up to the identity 2+4 = 6 by proving simpler related facts. First, 
2+2 = 2+ (1+1) = (2+1)+1 = 3+1 = 4 using the definition of 2, then associativity (A2), 
then the definition of 3, then the definition of 4. Second, 2+3 = 2+ (2+1) = (2+ 2)+1 = 
4+1 = 5 using the definition of 3, then associativity (A2), then the first result, then the 
definition of 5. Third, 2 + 4 = 2 + (3 + 1) = (2+ 3) + 1 = 5 + 1 = 6 using the definition of 
4, then associativity (A2), then the second result, then the definition of 6. 

Similarly, to prove 3—5 = —2, observe that 2+ (3—5) = 2+(3+(—5)) = (2+3)+(—5) = 
5 + (—5) = 0 by definition of subtraction, then associativity (A2), then the second result 
above, then inverses (A4). Now use 8.11(b) with z = 2 and y = 3 — 5 to see that y = =z, 
ie., 3—5 = =2. 


Any other arithmetical fact involving addition or subtraction of specific integers can be 
proved in a similar manner. 
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8.15. Remark: Generalized Associativity. Given three real numbers a, b, c, the expres- 
sion a + b + c is ambiguous: it could mean (a + b) +c or a+ (b + c). However, associativity 
(A2) assures us that the latter two expressions must be equal, so it is safe to write a+b+c 
with no parentheses. Similarly, given four real numbers a,b,c,d, the ambiguous expression 
a+b+c+d could represent any of the following five quantities: 


(a+ (b+c)) +d, ((a+b)+c)4+d, (a+b)+(c+d), a+ (b+ (c+d)), a+ (b+c) +d). 


Each expression on this list can be seen to be equal to the next expression by an appropriate 
special case of associativity (A2). For example, letting x = a+b, y = c, and z = din 
A2 shows that the second expression equals the third. Because of this fact, we often write 
a+6+c+d with no parentheses, and we can group adjacent terms together at will if needed 
in a particular calculation (as in the proof of Theorem 8.11(e)). A similar result holds for a 
sum of n terms: given a1, @2,...,@, E R, any possible parenthesization of a; + a2 +---+an 
produces the same final output. This fact is called generalized associativity, and its proof is 
far from obvious, as the n = 4 case already suggests. Since we have not yet defined what 
a positive integer n is, we cannot even state this general fact formally, much less prove it. 
Luckily, in our initial development, we only need this fact for a few specific small values of 
n (like n = 4), where it can be justified by explicit repeated uses of A2. 

There is a related fact called generalized commutativity based on A5. It states that we 
can rearrange the order of terms in a; + a2 +- + an in any fashion without changing the 
final result. For example, a+b+c+d=d+b+a+c=b+c+d+a, and so on. Henceforth, 
we use generalized associativity and commutativity (for small values of n) without specific 
comment. 


Section Summary 


1. Undefined Terms. Our axiomatic development of the real number system uses 
these undefined concepts: the set R, the operations £ +y, —x, x: y, and z~1, the 
numbers 0 and 1, and the relation x < y. (Here, x,y,z E€ R and z £0.) 


2. Axioms. Addition and multiplication satisfy the axioms of closure, associativity, 
identity, inverses, and commutativity. The distributive laws hold, and 0 4 1. The 
relation < is reflexive, antisymmetric, transitive, a total order, and is preserved by 
addition and multiplication by nonnegative numbers. Finally, the completeness 
axiom tells us that every nonempty subset of R that is bounded above has a least 
upper bound in R. 


3. Basic Definitions. We define xz — y = £ + (~y), x/z =x- z271, x < y iff x < y and 
xz Æ#y, x> yiffy< zx, and z > y iffy < ax (for x,y,z E€ R with z 4 0). Define 
2=1+1,3= 2+1, and so on. 

4. Properties of Addition. The additive cancellation law holds: for all x,y,z € R, 
xz +y =z +z => y= z. Zero is the unique additive identity; —x is the unique 
additive inverse of x € R; —0 = 0; —(—x) = x; and —(x + y) = (~y) + (—2) for 
all x,y € R. Addition satisfies generalized associativity and generalized commu- 
tativity. 


Exercises 


1. (a) Give a detailed proof of the right cancellation law without using axiom A5. 
(b) State and prove analogues of 8.11(a) and 8.11(b) based on the right cancella- 
tion law. 
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2. Complete the proof of Theorem 8.13. 
3. Deduce 8.11(c) from 8.11(b). 


4. Prove that 5+ 5 = 10 and 2—6 = —4 and 6 — 3 = 3 (you may need to prove 
some other related identities first). 


5. (a) Using only the addition axioms Al through A5, prove: Va, b,c,d € R, (a+c)+ 
(b+ d) = (a+b) + (c + d). (b) Using only axioms and theorems proved above, 
prove: Va,b,c,d € R, (a + c) — (b + d) = (a — b) + (c — d). 

6. Suppose we give meanings to the undefined symbols of our theory by letting 
R = {a} (where a is some fixed object), a +a = a, —a = a, a -a = a, 0 = a, 
1 = a, and LEQ = {(a,a)}. (Multiplicative inverses are not defined since nonzero 
elements of R do not exist.) Prove that this system satisfies all the axioms for the 
real numbers except the nontriviality axiom N. 


7. Let R? =R x R x R. Define addition, zero, and negatives on this set as follows: 
for all a,b,c, x,y,z € R, 


(a,b, c) + (x,y,z) = (@+2,b+y,e+ 2), 0= (0,0, 0), —(a,b, c) T (—a, —b, =c). 


Use known properties of R to show that R? satisfies the additive axioms Al 
through A5. 


8. Suppose we redefine the multiplicative operations in R by setting x-y = 0, 1 = 0, 
and z~! = 0 for all z,y,z € R with z 4 0. Which of axioms M1 through M5, D, 
and N are now true? 


9. Suppose we redefine addition on R by setting x +0 = x = 0 + z for all x € R, 
and y + z = 0 for all nonzero y,z € R. (a) Show that axioms A3, A4, and A5 are 
true, but the associativity axiom A2 is false. (b) Show that additive inverses are 
not unique in this system. 

10. Suppose we redefine addition on R by setting x + y = x for all x,y € R. Which 
of the addition axioms A1 through A5 are true? 

11. For x,y € R, define £ xy = x +y +2. Show that the addition axioms Al through 
A5 hold if we replace + by x, 0 by —2, and —x by —a — 4. 

12. Let X be a fixed set. Use theorems about bijections to prove that the set G 
consisting of all bijections f : X + X is a group if we replace addition (in axioms 
A1 through A4) by composition of functions, zero by Idx, and additive inverses 
by inverse functions. 


13. If possible, give an example of a field in which 0 = 3. 


14. Prove: for all a,b,c € R, (a +b) +c = (b+a)+c= (c+b)+a = (b+c)+a= 
(a+c)+b=(c+a) +b. Justify every step using axioms A2 and A5 only. 

15. (a) Define real numbers 11, 12, 13, 14, 15, 16, 17, and 18. (b) Make an addition 
table with rows and columns labeled 0,1,2,...,9, such that the entry in row a 
and column b is a + b. (c) Suppose you wrote a complete proof of your answer 
to (b). In such a proof, how many times would a definition, axiom, or previous 
result be invoked? Explain. 


16. List all complete parenthesizations of the expression a + b + c + d + e, where 
a,b,c,d,e € R. Use the associativity axiom A2 to prove that all these expressions 
are equal. 
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8.2 Algebraic Properties of Real Numbers 


We continue to develop properties of the arithmetical operations — addition, subtraction, 
multiplication, and division — based on the algebraic axioms Al through A5, M1 through 
M5, D, and N. The theorems proved using these axioms are valid in any field (such as the 
rational numbers, the complex numbers, or the integers modulo a prime). 


Multiplicative Properties of Zero 


Before developing general properties of multiplication, we need to examine the relationship 
between zero (the additive identity) and the multiplication operation. Our first result is 
very familiar, but the proof is surprisingly tricky. 


8.16. Theorem on Multiplication by Zero. 
Z1. For alla € R,0-a=Oanda-0=0. 


Proof. Fix a € R. [How do we even begin? This is the hard part — the trick is to think of 
examining the quantity 0-a+0-a.] We compute (0-a)+(0-a) = (0+0)-a =0-a using the 
distributive axiom (D) and the additive identity axiom (A3). Now recall Theorem 8.11(a): 
Ve,y€R,e+y=x2>y=0. Taking x = y = 0 -a here, we have verified that x+y = x and 
so we may conclude that y = 0, i.e., 0-a = 0. We prove a- 0 = 0 similarly, starting from 
the quantity a-0+a-0. This proof used addition axioms Al through A4, M1 (where’?), 
and D. [Since D and N are the only axioms connecting multiplication to addition, we can 
retroactively motivate the introduction of 0-a+0-a by noting that we needed some expression 
that would allow us to use the distributive law.] 


8.17. Theorem on Closure Properties of Nonzero Real Numbers. 
Z2. For all x,y € R, if x A0 andy £ 0, then z- y 40. 

Z3. No Zero Divisors: For all x,y € R, if x- y = 0, then z = 0 or y = 0. 
ZA. For all x,y € R, if x - y #0, then x £0 and y £0. 

Z5. For all x € R, if x £0, then z7! £0. 

Z6. For all x € R, if x # 0, then —x £0. 


Proof. Z2. [We try proof by contradiction.] Assume, to get a contradiction, that there exist 
x,y € R such that x 4 0 and y 40 and z- y = 0. We know 7! exists because x # 0; so 
we may multiply both sides of x - y = 0 on the left by xt, obtaining x~!- (x-y) = x- 0. 
Using associativity (M2) and the previous result (Z1), this becomes (a~!-a)-y = 0. By the 
inverse axiom (M4), we get 1-y = 0. By the identity axiom (M3), we deduce y = 0. But 
this contradicts the assumption y Æ 0. So property Z2 holds. 

Z3. The IF-statement here is the contrapositive of the IF-statement in Z2, so Z3 is 
logically equivalent to Z2. 

Z4. Fix x,y € R. Using a contrapositive proof, we assume x = 0 or y = 0, and prove 
x-y = 0. In the case where x = 0, x-y =0-y = 0 follows from Z1. Similarly, in the case 
where y = 0, x : y= 2-0=0 follows from Z1. 

Z5. Assume, to get a contradiction, that there exists x € R with x 4 0 (so a~* exists) 
and x! = 0. Multiply both sides of x7! = 0 by x to get x- x7! = x - 0. By the inverse 
axiom (M4) and Z1, we get 1 = 0. This contradicts the nontriviality axiom N, which states 
140. 

Z6. We use a contrapositive proof. Fix x € R; assume —xz = 0; prove x = 0. By 
Theorem 8.11, x = —(—x) = —0 = 0. 


1 
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In abstract algebra, an integral domain is a commutative ring where 1 4 0 and there 
are no zero divisors. In other words, an integral domain satisfies all the additive and multi- 
plicative axioms omitting M4 and multiplicative inverses, and including Z3 as an additional 
axiom. Later, we define Z (the set of integers) and prove that Z is an integral domain. 
Our proof of Z3 from the field axioms shows that every field is an integral domain, but the 
converse is false (Z provides a counterexample). 


Sign Rules; More Arithmetic 


When first learning about negative numbers, many people are surprised to learn that the 
product of two negative numbers is positive. This fact follows from the sign rules, which 
relate the additive inverse operation to multiplication. 


8.18. Theorem on Signs. For all a,b, x,y,z € R: 

(a) First Sign Rule: a- (—b) = —(a - b). 

b) Second Sign Rule: (—a) -b = —(a- b). 

c) Third Sign Rule: (—a)-(—b) =a- b. 

d) Inverses and Multiplication by —1: —x = (—1) - x. 

e) Distributive Laws for Subtraction: x-(y—z) = (x-y)— (x-z) and (y—2z)-x = (y-x)—(z-x). 
Proof. Fix a,b € R. For (a), we compute (a - b) + (a - (—b)) = a- (b+ (—b)) =a-0=0 
using the distributive axiom (D), then additive inverses (A4), then property Z1. Using 
Theorem 8.11(b) with z = a-b and y = a- (—b), we conclude y = —zx, i.e., a- (—b) = — (a-b). 
The second sign rule is proved similarly. Using both of these sign rules and the double 
negative rule, we compute 


(—a) - (=b) = —[(—a) : b] = -[- (a : b)] = a - b. 
To prove (d), fix x € R. By the sign rule and the identity axiom M3, (—1)-x = —(1-x) = —a. 
To prove (e), fix x,y,z € R. We compute 
x- (y =z) =x: (y + (=2)) =(@-y) + (x - (—2)) = (x: y) + =(2 - z) = (x - y) — (x: 2), 


by the definition of subtraction, the distributive law for addition, the sign rule, and the 
definition of subtraction. The other distributive law is proved similarly. 


fee E a ee 


Using the distributive law, the sign rules, and theorems about addition, we can now 
prove more arithmetical facts about specific real numbers. For example, let us show that 
2x 4 = 8. (Here, x x y is defined to mean zx- y.) Assume that we have already proved 
2+2 = 4, 4+ 2 = 6, and 6 + 2 = 8, using the technique illustrated in Proposition 8.14. 
First, 

2x 2= 2x (1+1)=(2x1)+(2x1)=2+2=4, 
using the definition of 2, then the distributive axiom (D), then the identity axiom (M3), 
then the known result for addition. Second, 


2x 3=2x (2+1)=(2x2)+(2x1)=4+2=6, 


using the definition of 3, then the distributive axiom (D), then previously proved results. 
Third, 
2x 4=2x (3+1)=(2x3)+(2x1)=6+2=8, 


using the definition of 4, then the distributive axiom (D), then previously proved results. 
Now, the sign rules allow us to conclude 


2x (—4) = -8 = (2) x4;  (=2) x (—4) = 8. 


Here is one more illustration of the power of the distributive law. 
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8.19. Theorem: FOIL Rule. For all t, x,y,z € R, (t + x): (y + z) = ty + tz + £y + az. 


Proof. Fix t,x,y,z € R. Set a = y + z € R. Using the distributive law three times, we find 
that (t+x)- (y +2) = (t+ zx) -a = ta + za = t(y +z) + z(y +2) = (ty + tz) + (£y + zz). 


Using this rule, we can deduce algebraic identities such as (2 + y)? = x? + 2xy + y? and 
(x + y)(x — y) = x? — y’; here 2? is defined to be z- z, for any real z. 


Properties of Multiplication and Inverses 


We now derive properties of multiplication and multiplicative inverses that are based on 
the left and right cancellation laws. These properties and their proofs are analogous to the 
corresponding properties for addition, except we need to ensure that we never take the 
multiplicative inverse of zero. 


8.20. Theorem on Multiplication. For all a,b,c, x,y € R: 


(a) Cancellation Laws for Multiplication: If a Æ 0 and a-b=a-c, then b = c. 
If a #0 and b-a = c-a, then b = c. 


b) Uniqueness of Multiplicative Identity: If x 40 and z: y = z, then y = 1. 


c) Uniqueness of Multiplicative Inverses: If x-y = 1, then x #0 and y £ 0 and y = x7t. 


( 

( 

(d) Inverse of Identity: 17! = 1. 

(e) Double Inverse Rule: If a #0, then a~! £0 and (a~!)-! =a. 
(f) 


f) Inverse of a Product: If a #0 and b £0, then a- b £0 and 
(a-6)-* = (b=) - (a). 


(g) Inverse of a Negative: If a #0, then —a £ 0 and (—a)~! = —(a7). 


Proof. We prove (c), (d), and (g) as examples (assuming (a) has already been proved). For 
(c), fix x,y € R; assume x- y = 1. Since 1 4 0 by axiom N, we deduce z 4 0 and y 40 
by Z4. Now we can use the inverse axiom (M4) to write x -y = x- a+. Using part (a) to 
cancel the nonzero real number x, we get y = x~!. For (d), use axiom N to see that 17! 
exists; then use M3 and M4 to compute 17} = 1-17! = 1. For (g), fix a € R with a Æ 0. 
We proved —a 4 0 in Z6, so a~! and (—a)~! both exist. Now (—a) -(—[a~4]) =a-at=1 
using the sign rules and M4. Invoking part (c) with x = —a and y = —[a~'], we conclude 
y =a", that is, —[a~+] = (—a)7?. 


As in the case of addition, using commutativity (M5) we can write (x-y)~'=a7!-y71 


when x,y are nonzero real numbers. We can also prove generalized associativity and gener- 
alized commutativity of multiplication by repeated use of M2 and M5. This justifies writing 
expressions such as abcd without any parentheses. We can omit even more parentheses by 
agreeing that multiplication takes precedence over addition, so that ab+cd means (ab)+ (cd) 
rather than a(b + c)d. 


8.21. Theorem on Division. For all x,y,z € R, if y #0 and z £0, then: 
(a) x/1 = zx. 


b) 1/y = y7} 
y/y =1. 
) y/z #0 


c) 
d 
e) (y/2)7} = z/y =y} z. 
f) 
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Proof. We prove part (g) as an example. Fix x,y,z € R with y and z nonzero. By Z2, 
y- z #0, so the division (xz)/(yz) is defined. We compute 
Lz = gts SSA, a 2 
TE = (02) (y2)? = (E2) (67T) = alle) = aly ay =E, 
using the definition of division, the rule for the inverse of a product, generalized associativity, 
the inverse axiom, the identity axiom, and the definition of division. 


Fraction Rules 
We now derive some algebraic rules for executing arithmetic operations on fractions. 
8.22. Theorem on Fractions. For all x,y € R and all nonzero t,u,w € R, 


(a) Addition Rules: Tpl as ee pee BA 
u 


U u u w uw 


(b) Negation Rules: 


(c) Subtraction Rules: 


x 
u 
T Y T-Y L y zrw-—uy 
F : 


(d) Multiplication Rules: x - z = -= y: os 


=1 
t . 
one ake 


Proof. We prove the addition rules, a multiplication rule, and a division rule as examples. 
Fix x,y € R and nonzero t,u,w E R. First, 


(u/u) + (y/u) =£- u? +y: u7! = (x+y) u™' = (x + y)/u, 


using the definition of division, then the distributive axiom (D), then the definition of 
division. Second, recall the previous result = = Z€ and 4 = #2, Combining this with what 
we just proved, we see that 


£ Yy TW: p uy AU uy 


u w uw ww uw ’ 


as needed. Third, compute 


xy x = = z 
~ = (zy)(uw)™* = (xy) (w7 tut) = (wu! )(yw*) = =- 
uw u w 
using the definition of division, the rule for the inverse of a product, generalized associativity 
and commutativity, and the definition of division. Finally, using the rule just proved, note 
that 
t w tw tw 
w t w:t tew 
By Theorem 8.20(c), we deduce (t/w)~! = w/t. 


(tw) (tw)7! = 1. 


Section Summary 


1. Properties of Zero. A product of real numbers is zero iff one of the factors is 
zero. The set of nonzero real numbers is closed under multiplication, inverses, 
and negatives. 
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2. 
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More Algebraic Properties. The sign rules state a(—b) = (—a)b = —(ab) and 
(—a)(—b) = ab for all a,b € R. Nonzero factors can be cancelled: if a Æ 0 and 
ab = ac, then b = c. The number 1 is the unique multiplicative identity. For a 4 0, 
aT! is the unique multiplicative inverse of a. When the inverses are defined, we 
have (a~!)~' =a, (ab)~! = b-1a7}, and (—a)~! = —(a~+). The distributive law 


generalizes to give identities such as (t + 2)(y+z) = ty + tz + zy + az. 


Fraction Rules. We can perform arithmetic on fractions via rules such as 


(a/u) + (y/w) = (xw + uy)/(uw), (x/u): (y/w) = (wy)/(uw), and (t/w)~* = w/t 
(assuming all denominators are nonzero). 


Exercises 


I: 


SOs GO ESL SOP AN 


10. 


11. 


(a) Prove Va € R,a- 0 = 0 by simplifying (a- 0) + (a- a). (b) Prove the sign rule 
Va,b € R, —(a- b) = (—a)- b. 


Finish the proof of Theorem 8.20. 


Use Z3 (no zero divisors) to prove the cancellation property in Theorem 8.20(a) 
without using multiplicative inverses. (It follows that this cancellation property 
holds in all integral domains.) 


Prove: (a) 3 x 3 = 9. (b) 5 x (—2) = —10. (c) (—2) x (-3) = 6. 

Prove: (a) 1/2 + 2/3 = 7/6. (b) 1/3 — 3/5 = —4/15. (c) (3/8) - (2/3) = 1/4. 
Finish the proof of Theorem 8.21. 

Prove Theorem 8.22 parts (b) and (c). 

Finish the proof of Theorem 8.22 parts (d) and (e). 


Use the distributive law to prove the following identities for all v, w,z,y,z E€ R. 
(a) (z +y}? = x? + Qry +42. 
( 


( 

(c) 

(d) w(x +y + 2z) = wz + wy + wz. 
(e) (v + w)(x +y + z) = vz + wz + vy + wy + vz + wz. 

(£) (x +y)? = x? + 3x?y + 3xy? + y? (where u? means u-u-u, for u € R). 

For x,y € R — {—1}, define zxy = x-y +x +y. (a) Show: Yz, y € R — {1}, 
xxy € R— {-1}. (b) Prove that axioms M2, M3, and M5 hold with - replaced 
by x and 1 replaced by 0. (c) Define an operation x’ (analogous to multiplicative 
inversion) and prove that for all x € R—{-1}, z’ € R—{—1} and xx2’ = 0 = a’ xx. 


Define the set of matrices M2(R) to be the set R*, where we display (a, b, c, d) € R* 


as the 2 x 2 array k 


. Define algebraic operations on matrices as follows: 


d 
for a,b,c,d,w,x,y,z E€ R, let 
a b| |w a) _ | aru b+e GRU | See —b 
c d y z| | ety d+z l’ c d| | —c -d |’ 
_ | 0 0 1 |10 
Om R) = 070l? MR =| g 4 |? 


a b| |w x]_ | awt+by ar+bz 
c d y z| | cwtdy catdz |’ 
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T2. 


c d ad—bc| —c a 
Use known algebraic properties of R to prove that Mə(R) satisfies all the alge- 
braic axioms except M5 and the requirement that every nonzero matrix have a 
multiplicative inverse. 
Which of the properties Z1 through Z6 hold for matrices in M2(R)? Explain. 
(Assume z7! exists when considering Z5.) 


-1 
k A = : | s a] when ad — bc 4 0. 
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8.3 Natural Numbers, Integers, and Rational Numbers 


We are all informally acquainted with the natural numbers N = Zso = {1,2,3,...}, the 
integers Z = {...,—2,—1,0,1,2,...}, and the rational numbers Q (ratios of integers). It is 
possible to give a rigorous construction of these number systems using set theory, defining 
first N, then Z, then Q, and finally R, and proving that each number system satisfies the 
appropriate algebraic and ordering properties. The details of this construction are very 
tedious and intricate, however, and we have chosen instead to begin with an axiom system 
for the final number system R on our list, which requires no prior knowledge of the more 
basic number systems. 

However, we frequently need integers and rational numbers to prove theorems about 
real numbers and calculus. It turns out we can define N, Z, and Q as certain subsets of 
the set R, and then derive the basic properties of these number systems fairly quickly from 
the axioms for R. In particular, the Induction Axiom for N (discussed in §4.1) is provable 
from our definition of N given below. Aided by induction, we show that N, Z, and Q satisfy 
appropriate closure properties among those listed in axioms Al and M1. The remaining 
algebraic and order properties of these number systems then follow automatically from the 
corresponding properties of R. 


Inductive Sets; Defining Natural Numbers 


Intuitively, the set of natural numbers! is N = {1,2,3,...}. The trouble with this intuitive 
description is that it does not precisely define what is meant by the ... at the end. Given 
that real numbers and addition are already available to us, we can approach this problem 
via closure properties in R. Restating the intuitive description, N should be the set of all 
real numbers that we can reach by starting with 1 and adding 1 repeatedly. This motivates 
the following definition. 


8.23. Definition: Inductive Sets. Given S C R, 
S is inductive | iff |1 € S and Vz € R, x E€ S > (x +1)e S|. 


Thus, an inductive set must be closed under 1 (meaning 1 is in the set) and closed under 
adding 1. For example, R itself is an inductive set due to the closure axioms Al and M1. 
Later, when we discuss properties of order, we will see that R>o and Rs, are also inductive 
sets. Informally, Z and Q (not yet officially defined) are inductive sets. All of these inductive 
sets are too big to be N because they contain extra elements not reachable from 1 by adding 
1. The following clever device lets us precisely define the idea that N should be the smallest 
possible inductive subset of R. 


8.24. Definition of the Natural Numbers N. Let Z be the set of all inductive subsets 
of R. Define | N = Msez 5 | 


Thus, N is the intersection of all inductive subsets of R. Since R itself is inductive, Z 
is nonempty, and the generalized intersection is defined. Expanding the definition, we see 
that for all objects x, |x € N | iff | for every inductive set SCR, x € S|. 


8.25. Theorem. N is an inductive set. 


lIn the next few sections, we use the notation N, rather than Z>o, to avoid assuming in advance that 
natural numbers are the same as positive integers. This fact is proved as a theorem later. 
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Proof. For every inductive set S, 1 € S. So 1 € N. Next we show N is closed under adding 
1. Fix x € N and prove xz + 1 € N. For any inductive set S, we know x € S,soxr+1eES9 
since S is inductive. Since this holds for all inductive sets S, we see that x +1 € N. 


We now prove two versions of the Induction Axiom for N. The second version (involving 
open sentences) is the one discussed earlier in §4.1. 


8.26. Theorem: Set-Theoretic Induction Principle. Suppose S is a set satisfying these 
conditions: (a) S C N; (b) 1 € S; (c) for all x € S, x +1 € S. Then S =N. 


Proof. Assume S' satisfies the three conditions. By condition (a), it suffices to prove that 
N C S. By conditions (b) and (c), S is an inductive set. Thus N, which is the intersection 
of all inductive sets, must be a subset of S. 


8.27. Theorem: Induction Principle. Suppose P(n) is an open sentence such that P(1) 
is true, and for all n € N, if P(n) is true, then P(n + 1) is true. Then for all n € N, P(n) is 
true. 


Proof. Given the open sentence P(n), form the set S = {n € N : P(n) is true}. By definition 
of S and the conditions on P(n), S satisfies the three conditions in the Set-Theoretic 
Induction Principle. For example, to check condition (c), fix x € S. We know x € N and 
P(x) is true, so we deduce x + 1 € N and P(x + 1) is true, hence x + 1 € S. Now the 
Set-Theoretic Induction Principle tells us that S = N, which means that P(n) is true for 
all n EN. 


Closure Properties of N 


Since N is an inductive set, we know that N contains 1 and is closed under adding 1. Using 
induction, we can now prove the stronger statements that N is closed under addition and 
multiplication. 


8.28. Theorem on Closure of N under Addition. For all m,nEN,m+nEeN. 


Proof. Let mo be a fixed element of N. For n € N, let P(n) be the statement “mo +n € N.” 
We use induction on n to prove P(n) is true for every n € N. To see that P(1) is true, 
note that mo + 1 € N because mo € N and N is closed under adding 1. Next, fix no € N, 
assume P(no), and prove P(no + 1). We have assumed mo + no € N, and we must prove 
mo + (no + 1) € N. Using associativity in R (A2), we know mo + (no + 1) = (mo + no) + 1. 
Since mo +no € N and N is closed under adding 1, (mo +no)+1 € N. Thus mo+(no+1) € N, 
as needed. 


8.29. Theorem on Closure of N under Multiplication. For all m,n EN, m-neéEN. 


Proof. Let mo be a fixed element of N. We prove Vn € N, mo-n € N by induction on n. 
When n = 1, the identity axiom (M3) gives mo -1 = mo € N. Now fix no € N, assume 
mo:no € N, and prove mo: (no+1) € N. By the distributive law in R and (M3), mo-(no+1) = 
Mo: no +Mo: 1 = Mo: no + mo. By induction hypothesis, mo no € N. By closure of N under 
addition, Mo - no + Mo € N. So mo - (no + 1) € N, as needed. 


On the other hand, N is not closed under additive inverses, subtraction, multiplicative 
inverses, or division. To prove these claims, we need order properties to be discussed later. 
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Defining Integers; Closure Properties of Z 


Informally, we obtain the set of integers from the set of natural numbers (positive integers) 
by adjoining zero and the negatives of all the positive integers. These negatives are known 
to exist in R, by the closure axiom A1 for R. Here is the official definition. 


8.30. Definition of Integers. For all x € R, |x € Z| if x €N or x =0 or =z EN. | Ele- 
ments of Z are called integers. 


Now we show that Z is closed under all the arithmetical operations except multiplicative 
inversion and division. 


8.31. Theorem on Closure of Z under Negation. For all y € Z, —y E€ Z. 


Proof. Fix y € Z; prove —y € Z. We know y E N or y = 0 or —y €E N, so consider three 
cases. 

Case 1. Assume y € N. Then —(—y) = y € N by the double negative rule, so —y € Z 
because —y satisfies the third alternative in the definition of Z (taking x there to be —y). 
Case 2. Assume y = 0. Then —y = —0 = 0, so —y € Z by the second alternative in the 
definition of Z. 

Case 3. Assume —y € N. Then —y € Z by the first alternative in the definition of Z (taking 
x there to be —y). 


8.32. Theorem on Closure of Z under Multiplication. For all x,y E€ Z, £- y € Z. 


Proof. Fix x,y € Z; prove x -y € Z. We know x € N or z = 0 or —a EN, and similarly for 
y, giving nine cases. Five of these cases can be handled by the observation that if x = 0 or 
y = 0, then x-y =0 € Z. Now we examine the remaining four cases. 

Case 1. Assume x € N and y € N. We already proved z -y E€ N, so z- y €Z. 

Case 2. Assume x € N and —y € N. By the sign rules and closure of N under multiplication, 
we deduce — (x - y) = x - (—y) € N. So x - y € Z by the third alternative in the definition of 
Z: 

Case 3. Assume —x € N and y € N. As in Case 2, we get —(x - y) = (—x) -y € N, hence 
z-y EZ. 

Case 4. Assume —x € N and —y EN. The sign rules and closure of N give x-y = (—x)-(—y) € 
N, soxz-y Ez. 


Proving closure of Z under addition turns out to be trickier; we begin with a lemma. 


8.33. Lemma. For all x € Z, xzx—1€Z. 


Proof. Fix x € Z; prove x — 1 = x + (—1) € Z. We know x € N or x = 0 or —2 EN, so 

consider three cases. 

Case 1. Assume x € N. We prove x — 1 € Z by induction on a. If x = 1, then 1—1 =0 €Z. 

For the induction step, fix x € N, assume x — 1 € Z, and prove (x +1) — 1 € Z. Since 

(xz+1)—1=xz+(1—1)=x+0 = rz EN, we can conclude that (x +1) — 1 € Z without 

even using the assumption zx — 1 € Z. 

Case 2. Assume x = 0; prove x — 1 € Z. Here, x— 1 = 0—1 = —1 € Z since —(-1) =1 € N. 

Case 3. Assume —x € N; prove x — 1 € Z. Write —x = n € N. Since N is closed under 

adding 1, we know n + 1 € N. Now g — 1 = —(—2) + (—1) = (—n) + (—1) = —(n + 1), so 
(x—1)=n+1 €N, sox- 1 €Z by the third alternative in the definition of Z. 


8.34. Theorem on Closure of Z under Addition. For all x,y € Z, x +y € Z. 
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Proof. Fix x,y € Z. We know x € N or g = 0 or —2 EN, and similarly for y. If x = 0, then 
e+y=O0+y=yeZ. Ify = 0, then x+y = x+0 = z € Z. So we can reduce to the 
following four cases. 

Case 1. Assume x,y € N. We already proved x+y E€ N, so xr +y EZ. 

Case 2. Assume —x,—y € N. Write n = —x and m = —y; since n,m € N, we know 
m+n EN. Then —(x+y) = [+y] +[-2] = m+n €N, so x+y € Z by the third alternative 
in the definition of Z. 

Case 3. Assume x € N and —y € N. We prove that x+y € Z by induction on n = —y. Since 
y = —n, our goal is to show that x—n € Z for each natural number n. If n = 1, then x—1 € Z 
by the lemma. For the induction step, fix n € N, assume x—n € Z, and prove x—(n+1) € Z. 
We know x- (n+1) = r+ (—(n+1)) = r+ ((—n)+(—1)) = (x+ (—n))+(-1) = (x—-n)—1. 
By hypothesis, x — n € Z. By the lemma, (x — n) — 1 € Z, completing the induction step 
for Case 3. 

Case 4. Assume —x € N and y € N. We can prove (—x) + y € Z as in Case 3, or we can 
reduce this case to Case 3 by noting (~x) + y = y + (~x). 


It follows that Z is closed under subtraction, since x — y = x + (—y). Since Z C R 
by construction, we can also immediately conclude that the set Z satisfies all the axioms 
and theorems discussed so far that do not mention multiplicative inverses, division, or 
least upper bounds. For instance, since x + y = y + holds for all real numbers x and 
y by A5, this equation must also hold for all x and y in the subset Z of R. As another 
example, properties Z2 and Z3 (absence of zero divisors) hold for Z even though their 
proofs used the concept of multiplicative inverses in R. In summary, we have proved that 
Z is an ordered integral domain. | All the properties of integers proved in Chapter 4 (such as 


theorems on divisibility, primes, greatest common divisors, and unique prime factorizations) 
are now Officially available for use in our formal development of R. 


Defining Rational Numbers 


Now that we have formally constructed Z within R, the definition of rational numbers given 
in §2.1 is valid. Recall that for all x € R, we defined 


LEQ) & JAmeZ Ane ZnZ40Ar=m/n'\. 


Using the fraction rules, we can prove the following closure properties of Q. 


8.35. Theorem on Closure Properties of Q. For all z,y € Q, x+y € Q, =z € Q, 
£ — y € Q, and z- y € Q; if y £0, then y7} € Q and z/y € Q. Moreover, Z C Q, so 0€ Q 
and 1 € Q. 


Proof. We prove closure under subtraction as an example. Fix x,y € Q; write z = a/b and 
y = c/d for some a,b,c,d € Z with b, d Æ 0. By the fraction rules, x — y = (a/b) — (c/d) = 
(ad — bc)/(bd). By closure properties of Z proved earlier, we know ad, bc, ad — bc, and bd 
are in Z with bd Æ 0 (by Z2). So x — y meets the criterion for membership in Q, taking 
m = ad — bc and n = bd in the definition above. 

To prove that Z C Q, note that 1 € Z (since 1 € N) and 1 Æ 0 by axiom N. So for any 
m € Z, writing m = m/1 shows that m € Q. 


Having proved all the closure properties, we conclude that |Q is an ordered field |; in 
other words, the set Q with the restricted arithmetical operations and order relation satisfies 
all the axioms for R (and their consequences) except possibly completeness. In particular, 
the unique prime factorization theorem for Q (see §4.7) is now available. One consequence 
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of that theorem was the fact that for any positive integer n that has an odd exponent 
in its prime factorization, the equation x? = n has no solution x in Q. It can be proved 
from the Completeness Axiom (O7) that this equation does have a solution x in R; see 
Theorem 8.61 below. These facts combine to show that the number system Q does not 
satisfy the Completeness Axiom. This axiom thus reveals a crucial difference between the 
rational number system and the real number system. Indeed, many of the central theorems 
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of calculus would not work if we used Q instead of R. 


Section Summary 


I; 


Natural Numbers. An inductive set is a subset of R containing 1 and closed under 
adding 1. The set N is the intersection of all inductive sets. N itself is an inductive 
set closed under addition and multiplication. To prove Vn € N, P(n), we may 
prove that P(1) is true and for all n € N, if P(n) is true, then P(n + 1) is true. 


Integers. The set Z consists of all real numbers x such that x € N or x = 0 or 
—a EN. Z is closed under addition, negation, and multiplication. Z is an ordered 
integral domain. 


Rational Numbers. The set Q consists of all real numbers x such that there exist 
m,n € Z with n 4 0 and z = m/n. Q contains Z and is closed under addition, 
negation, multiplication, and inverses of nonzero elements. Q is an ordered field 
that does not satisfy the Completeness Axiom. 


Exercises 


10. 


Finish the proof of Theorem 8.35. 
Prove or disprove: the union of any nonempty collection of inductive subsets of 
R is an inductive set. 

Decide (with proof) whether each set below is inductive. [You may assume that 
1 > 0 has already been proved.] (a) Z>o (b) Z>ı (c) Z>ı (d) Ø (e) the set of 
odd integers (f) {k/3:k €Z} (g) {1,2,3,4,5,6,7,8,9} (your proof should use 
1 > 0 somewhere). 

(a) Show that R — Z is closed under adding 1. (b) Is R—Z an inductive set? (c) Is 
R — Z closed under addition? negation? multiplication? multiplicative inverses? 
When answering (c), assume we have proved that 27t € R — Z. 

Repeat the previous exercise, replacing R — Z by R—Q and replacing 27! by V2. 
Show that Z is the intersection of all subsets of R containing 0 and 1 and closed 
under addition and negation. 


Show that Z is the intersection of all subsets of R containing 1 and closed under 
subtraction. 


Show that Z is the intersection of all inductive subsets closed under subtracting 
1. Use this to formulate an induction principle for proving statements of the form 
Yn € Z, P(n). 

Show that Q is the intersection of all subsets of R containing 0 and 1 and closed 
under addition, subtraction, multiplication, and division by nonzero elements. 


Double Induction. Suppose S is a set such that: (a) S C N x N; (b) (1,1) € S; 
(c) v(x, y) E€ S, (x +1,y) E€ SA (x,y +1) € S. Prove that S =N x N. 
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11. State an induction principle for proving statements of the form 
Ym E N, Yn € N, P(m,n). Prove this principle using the previous exercise. 


12. We know from the Division Theorem that there is a function MOD : Z x Zs9 > Z 
given by MOD(a,b) = a mod b = the unique remainder when a is divided by b. 
For n € Zs, define the set of integers modulo n to be 


Zn = {0,1,2,...,.n-l}={@eEZ:0<a <n}. 


Define algebraic operations and constants on Zp as follows: for x,y € Zn, let 
Tn y = (x +y) mod n, Ong = (—x) mod n, £ Qn y = (x - y) mod n, On = 0, 
and 1, = 1. Prove that Z, with these operations is a nonzero commutative ring 
(i.e., satisfies all algebraic axioms excluding M4). It can help to first prove this 
lemma: for all a,b € Zn, a = b iff n divides a — b. (An alternative construction of 
Zn, based on the equivalence relation =,, is given in 86.6.) 


13. Continuing the previous exercise, suppose p is a prime. (a) Use gcds to prove: for 
all nonzero k € Z,, there exists a unique m € Zp with k 8p Mm = 1 = m &p k. 
Define the multiplicative inverse of k in Zp to be this unique m. (b) Show that 
Zp is a field. (c) Show that if n is not prime, then Z, does not satisfy Z2 and 
therefore cannot be a field. 
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8.4 Ordering, Absolute Value, and Distance 


Now that we understand the basic algebraic properties of R, we begin to study the conse- 
quences of the order axioms O1 through O6. We develop fundamental laws of inequalities 
as well as various facts about the absolute value function. This leads to a discussion of the 
concept of distance on the real line, which is defined in terms of absolute value. 


Ordering Properties 


The basic ordering axioms were stated in §8.1. Axioms O1 through O4 postulate that R is 
totally ordered by the relation <. In detail, for all x,y,z € R, x < x (reflexivity, axiom O1); 
if x < y and y < a, then x = y (antisymmetry, axiom O2); if x < y and y < z, then x < z 
(transitivity, axiom 03); and x < y or y < x (total ordering, axiom 04). We also defined 
x > y to mean y < x, x < y to mean x < y and x Æ y, and x > y to mean y < z. 

Our first result derives the properties of the strict order < that correspond to the prop- 
erties of < listed above. This result applies to any totally ordered set (X, <), not just 
X =R. 


8.36. Theorem on Order Properties of <, >, and >. For all z,y,z € R: 

(a) Irreflexivity: x < x is false. 

(b) Transitivity: If x < y and y < z, then x < z. 
(c)a<yiffa<yorr=y,ande>yiffe>yorr=y. 

(d) Trichotomy: Exactly one of these three statements is true: £ < y; £ = Y; y < T. 
(e) Parts (a), (b), and (d) hold with > in place of <. 

(f) Axioms O1 through O4 hold with > in place of <. 

( 


g) If x < yand y < z, then z < z. If x < y and y < z, then g < z. 
Similar statements hold for > and >. 


Proof. Fix x,y,z € R. To prove (a), note that x < x is defined to mean “r < x and z Æ x.” 
But x = z is true, so this AND-statement is false. 

To prove (b), assume x < y and y < z; prove x < z. We have assumed x < y anda Æ y 
and y < z and y Æ z; we must prove x < z and x Æ z. On one hand, using x < y and y < z 
and transitivity (03), we get x < z. On the other hand, if we had x = z, then z < y and 
y < z = and antisymmetry (O2) would give x = y, which contradicts our assumption. 
Thus, x Æ z. 

To prove (d) assuming (c) has been proved, use total ordering (O4) to see that x < y 
or y < x is true; i.e., “£ < y or x = y or y < x or y = x” is true. Thus at least one of the 
three statements in (d) must hold. If x < y and z = y were both true, we would deduce 
x < x, contradicting (a). Similarly x = y and y < x cannot both be true. Finally, if z < y 
and y < x were both true, (b) would give x < x, contradicting (a). Thus exactly one of the 
given alternatives holds. 

We leave the remaining assertions as exercises in manipulating the definitions. 


The next result uses trichotomy to deduce rules for forming useful denials of inequalities. 
8.37. Theorem on Denial Rules for Inequalities. For all x,y € R: 
(a) ~r <y ey<rer>y. 
(b) (a<yey<reury. 
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(c) ~(a>y)Syr>rerc<y. 
(d)“(a@>y)eyrzueu<y. 


Proof. We prove (a) as an example. Fix x,y € R. First assume x < y is false. By The- 
orem 8.36(c), “z < y or x = y” is false, so trichotomy forces y < x to be true. This is 
equivalent to x > y by definition. Conversely, assume y < x is true. By trichotomy, x < y 
is false and x = y is false, so “a < y or x = y” is false. By Theorem 8.36(c), we see that 
x < y is false. 


Algebraic Properties of Order 


Next we investigate the consequences of axioms O5 and O6, which relate the algebraic 
operations of addition and multiplication to the ordering relation on R. We recall that the 
additive axiom (O5) states 


Vr,y,z ER, <y> xr+z<y+z, 
and the multiplicative axiom (O6) states 
Vo,y,zER (a@<yA0<z)Sau-z2<y-z. 


8.38. Theorem on Algebraic Properties of Order. For all w, x,y,z € R: 
a) Order Reversal: x < y iff —y < —a. 
b) Adding Inequalities: If x < y and w < z, then r +w < y +z. 


d) Squares are Nonnegative: 0 < w?. 


e) Parts (a), (b), (c), axiom O5, and axiom O6 hold with < replaced by < throughout. 


( 

( 

(c) Multiplying by a Negative Number: If x < y and z < 0, then y-z<a-z. 

( 

( 

In place of (d), we have: Vw € R, w 40 => 0 < w°. Similar properties hold for > and >. 


Proof. Fix w,x,y,z € R. To prove (a), first assume z < y. Taking z = —x + (—y) in 
(O5), the assumption yields x + (—2 + (—y)) < y + (~x + (—y)). This becomes —y < —x 
after simplification via known algebra rules. Conversely, assume —y < —ax. Adding x + y 
to both sides (using O5) leads to x < y. Alternatively, the converse can be deduced from 
the direction already proved: from —y < —x we get —(—x) < —(—y), so that x < y by the 
double negative rule. 

To prove (b), assume x < y and w < z. Adding w to both sides of x < y, we get 
xz+w < y+w. Adding y to both sides of w < z, we get y +w < y + z. Now transitivity 
gives r +w <y+z. 

To prove (c), assume x < y and z < 0. By (a), 0 = —0 < —z. So we may apply (06) 
with z there replaced by —z, obtaining x- (—z) < y- (—z). This becomes —(xz) < — (yz) by 
the sign rules, so yz < xz follows from (a). 

Next we prove the version of (O6) for <, namely: if x < y and 0 < z, then zz < yz. 
Assume x < y and 0 < z. Then x < y and 0 < z, so the original O6 gives xz < yz. To 
conclude zz < yz, we show that xz = yz cannot occur. Note that 0 < z forces z 4 0, by 
trichotomy. So if xz = yz were true, we could cancel z to get x = y, but this contradicts 
the assumption x < y. So xz < yz does hold. 

Next we prove the version of (d) for <. Assuming w 4 0, we must have 0 < w or w < 0 
by trichotomy. 

Case 1. Assume 0 < w. The version of (O6) just proved (taking x = 0, y = w, and z = w) 
gives 0- w < w- w, so 0 < w?. 
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Case 2. Assume w < 0, so 0 = —0 < —w by (a). Now take z = 0, y = —w and z = —w in 
the version of (06) for < to conclude 0- (~w) < (—w) - (~w), so 0 < w? by the sign rules. 
Thus, 0 < w? holds in all cases. 

The remaining assertions are left as exercises. 


Note that 1? = 1-1 = 1 by the identity axiom (M3). Also 1 4 0 by axiom N. Thus, 
as a corollary to the strict version of (d) above, we get |0 < 1). This fact has surprisingly 
deep consequences, as we will see shortly. As one application, we can repeatedly add 1 to 
the inequality 0 < 1 and use Theorem 8.38(a) to obtain 


—10 < -< —4 < -3 < —2 < -1 <0 <1 <2<3<4<.--- <10. 


Positive and Negative Numbers 


Now we can define positive and negative real numbers and study how these concepts relate 
to inequalities. 


8.39. Definition. For all x € R,| x is (strictly) positive |iff| x > 0 |; |x is (strictly) negative 


;| x is weakly negative iff | z<o l We write Ryo 


iff} ¢ < 0f} |x is weakly positive iff | r>0 
for the set of positive real numbers; R<o, R>o, R<o, Zso, etc., are defined similarly. 


Using the denial rules for inequalities, one sees that x is weakly positive iff x is nonneg- 
ative (not negative), and x is weakly negative iff x is nonpositive. For us, the unqualified 
term “positive” means “strictly positive.” We warn the reader that some texts adopt the 
opposite convention: for them, “positive” means “weakly positive.” 

Here are some basic properties of positive numbers. You can formulate and prove anal- 
ogous properties of negative numbers, nonnegative numbers, and nonpositive numbers. 


8.40. Theorem on Positive Numbers. 


(a) For all z,y € Ryo, £ +y € Ryo, zy € Ryo, and a1, y7! € Ryo. 
Also, 2 < y iff y7! < 27}. 


(b) For all a € R, a is positive iff —a is negative; and a is negative iff —a is positive. 


(c) For all a,b € R, ab € Ryo iff a and b are both positive or both negative. 


Proof. We prove (c) and part of (a) as examples. For (a), fix x,y E€ Rso. By trichotomy 
applied to x, x # 0, so 2~! exists. By trichotomy applied to x71, we know 2! < 0 or 
x '=0Oor 2! > 0. In the case x~! < 0, we can multiply both sides of this inequality by 
the positive quantity x, obtaining x -7t < x-0, or 1 < 0. This contradicts trichotomy 
applied to 1, since we proved 0 < 1 above. So x7! < 0 is impossible. Similarly, in the case 
x ' = 0, multiplying both sides by x leads to 1 = 0, which violates axiom N. Thus, the 
third case x71 > 0 must hold. Similarly, y7! > 0. Now assume x < y. Multiply both sides 
by the positive quantity £7! to get 1 < x~ly. Then multiply both sides by the positive 
quantity y7! to get y~! < a~!. Conversely, assume y~! < x71. The result just proved and 
the double inverse rule lead to x < y. 

We prove (c) using proof by cases, assuming (b) has been proved. Fix a,b € R. If a and 
b are positive, then so is ab by part (a). If a and b are negative, then —a and —0 are positive 
by (b), so ab = (—a)(—b) is positive by (a) and the sign rules. If a or b is zero, then ab is 
zero, so ab is not positive. If a is positive and b is negative, then —b is positive by (b), so 
—ab = a(—b) is positive by the sign rules, so ab is negative by (b), so ab is not positive by 
trichotomy. The same conclusion holds if a is negative and b is positive. By trichotomy, all 
possible cases have now been considered, so (c) holds. 
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We can now deduce that N (the set of natural numbers, as defined in §8.3) coincides 
with Zso (the set of positive integers). Another useful technical fact contained in the next 
lemma is that there can be no integer strictly between two consecutive integers k and k +1. 


8.41. Lemma on N and Zo. 

(a) For alln EN, 1 <n. 

(b) For allm €Z—N,m<0. 

(c) N=ZORso = Zso. 

(d) For all n,k € Z, k < n < k +1 is false. 

(e) For all m,n E Z, m <n+1ifm<norm=n+1. 


Proof. We prove (a) by induction on n. For n = 1, 1 < 1 is true by reflexivity (O1). Fix 
n € N, assume 1 < n, and prove 1 < n + 1. We know 0 < 1; adding this inequality to the 
assumption 1 < n gives 1+0 <n+1,o 1<n+l1. 

To prove (b), fix m € Z — N. By definition of Z, we know m € N or m = 0 or -m E N. 
The first case cannot occur since we assumed m ¢ N. In the second case, m = 0, and 0 < 0 
by reflexivity (O1). In the third case, 0 < 1 < —m by part (a), so 0 < —m and hence m < 0 
by order reversal. Thus m < 0 in this case as well. 

To prove (c), fix n € N. Then n € Z by definition, and n > 1 > 0, so n € Ryo. Thus, 
N C ZAR,o. On the other hand, fix k € ZA Ryo. If k were not in N, we would have k < 0 
by (b). But we assumed k > 0 also, so this would contradict trichotomy. Thus, k € N, so 
ZORso C N. The equality ZARso = Zso follows from the definition Z>o = {a € Z : x > 0}. 

We prove (d) by contradiction. Assume, to get a contradiction, that there exist n, k € Z 
with k < n < k+1. Adding —k to these inequalities, we get 0 < n—k < 1, where m = n— k 
is an integer by closure of Z under subtraction. Since m > 0, we must have m € N by (c). 
But then m > 1 by (a), and now m < 1 < m contradicts trichotomy. 

To prove (e), fix m,n € Z. First assume m < n + 1, and prove m < n or m =n + 1. To 
do so, further assume m < n is false, and prove m = n+1. We have assumed n < m < n1. 
Since n < m < n + 1 is impossible by (d), we conclude that m = n + 1. For the converse, 
assume m < n or m = n + 1, and prove m < n + 1. In the case where m < n, note that 
n < n+ 1 (as we see by adding n to 0 < 1), som < n + 1 by transitivity. In the case 
m=n+1, m <n +1 holds by reflexivity (01). 


Absolute Value 


We define a function ABS : R > R, denoted ABs(x) = |x| for x € R, using definition by 
cases. 


8.42. Definition: Absolute Value. For x € R, let |z| = x if 0 < x, and let |x| = —a if 
x <0. 


By total ordering (axiom O4), every real x satisfies at least one of the two cases in the 
definition. Moreover, the two cases overlap consistently: if 0 < x and x < 0 both hold, then 
x = 0 by antisymmetry (02), and z = —x = 0, so |0| = 0. Thus, we do have a well-defined 
(single-valued) absolute value function. The next theorem gives the basic properties of this 
function. 

8.43. Theorem on Absolute Value. For all z,y,r € R: 


(a) 0 < |z|, and |z| = 0 iff z = 0. 


( 
(c) |æ -y| = |x| - tyl. 
(d) |æ] < y iff —y < x < y (similarly for <). 
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(e) |z — y| <r iffy—r<a<y+r (similarly for <). 

(£) —|2| < x < |e]. 

Proof. We prove (b), (d) and (f), leaving the other parts as exercises. For (b), fix x € R. 


We know 0 < x or x < 0, so consider two cases. 
Case 1. Assume 0 < 2, so —x < 0. Using the definition of absolute value twice, we see that 


|x| = x and |—a| = —(—a) = a, so |x| = |—2]. 
Case 2. Assume xz < 0, so 0 < —a. This time, the definition of absolute value gives |x| = —« 
and |—2| = —2, so |z| = |—z]. 


For (d), fix x,y € R. First, assume |x| < y, and prove —y < x and x < y. By (a), 0 < |z], 
so 0 < y by transitivity, and —y < 0. In the case where 0 < x, we know |z| = x, so x < y 
by assumption, and moreover —y < 0 < x gives —y < x by transitivity. In the case where 
x <0, we know |x| = —x, so the assumption becomes —x < y, or —y < x. Alsox <0 <y 
gives x < y by transitivity. Thus, —y < x < y in both cases. 

Conversely, assume —y < x < y, and prove |z| < y. In the case where 0 < x, we know 
|z| = x, so we must prove x < y. We have just assumed this. In the case where z < 0, we 
know |z| = —2, so we must prove —a < y. We get this from the assumption —y < x by 
order reversal and the double negative rule. 

Finally, (f) follows from (d) by taking y = |x| in (d). Since |æ| < |z| is known by 
reflexivity (O1), the equivalence in (d) lets us deduce that -|x| < x < |z]. 


Part (a) of the next theorem contains a key property of absolute value called the Tri- 
angle Inequality. The other parts of this theorem give variations of this inequality that are 
frequently used in analysis. 

8.44. Theorem on Triangle Inequality and Variations. For all a,b, x,y,z € R: 

(a) |a +b] < la] + |b. 

(b) |z — yl < [a] + [yl 
(c) |æ +y +2] < |x| + [yl + Iz]. 
(d) Ja — yl > |x| — [yl 
( 
( 
( 


e) |x + y| > |æ] — Iyl- 
f) |æ = z| < |z = y| + |y = zl. 


g) lle] —|yll < |e = yl. 


Proof. We prove (a), (e), and (f), leaving the other parts as exercises. For (a), fix a,b € R. 
Part (d) of the previous theorem (with « = a+b and y = |a| + |b|) tells us that the Triangle 
Inequality |a + b| < |a| + |b| is logically equivalent to — (|a| + |b|) < a+b < |a| + |b|. Using 
part (f) of the previous theorem with x = a and then x = b, we know that —ļa| < a < Jal 
and —|b| < b < |b|. Adding these inequalities and using algebra, we get — (|a| + |b|) < a+b < 
ja| + |b|, as needed. 

To prove (e), fix x,y € R. The trick is to notice that « = (x +y) +(—y), so that part (a) 
(with a = x + y and b = —y) gives |x| < |x +y|+ |-y| = |x +y| + |y]. Adding —|y| to both 
sides and rearranging gives |x + y| > |z| — |y|, as needed. 

To prove (f), fix x,y,z € R. Take a = x — y and b = y — z in part (a), noting that 
a+b= rz- z, to see that |x — z| < |£ — y| + ly — 2]. 


Distance 


We can use absolute value to define the distance between any two real numbers, as follows. 
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8.45. Definition: Distance in R. For all x,y € R, define the distance from x to y to be 
d(x,y) = |x — yl. 
The distance function satisfies the following properties. 
8.46. Theorem on Distance. For all x,y,z € R, 
(a) Positivity: d(x,y) > 0, and d(x,y) = 0 if z = y. 
(b) Symmetry: d(x,y) = d(y, x). 
(c) Triangle Inequality: d(x,z) < d(x,y) + d(y, z). 


Proof. These properties all follow readily from corresponding properties of absolute value. 
For example, given x,y € R, d(x,y) = |z — y| = |- (æ - y)| = |y — 2| = d(y, 2). 


In analysis, a set X together with a function d : X x X — R satisfying the three 
conditions in the last theorem (for all x, y, z € X) is called a metric space. The real number 
system R provides one of the most basic examples of a metric space. More generally, one 
can show that R” (with the Euclidean distance function d(Z,9) = Y}; (ai — yi)?) is a 
metric space, although the proof of the Triangle Inequality is harder. 


Section Summary 


1. Ordering Properties. The relation < on R satisfies irreflexivity, transitivity, and 
trichotomy; for x,y € R, x < y iff x < y and z Æ y; x < y if x < yor x = y; 
~(x < y) iff x > y; x is positive iff —x is negative; R>o is closed under addition, 
multiplication, and multiplicative inversion; 0 < x < y iff 0 < y7! < 27}; and so 
on. 


2. Properties Relating Order and Algebra. Inequalities are preserved when we add a 
constant or multiply by a positive constant; they are reversed when we multiply 
by a negative constant. Nonzero squares are strictly positive, so 0 < 1. 


3. Ordering of Integers. Every n € N satisfies n > 1, whereas every m € Z — N 
satisfies m < 0. So N = Zo; there are no integers k,n satisfying k < n < k + 1; 
and for all m,n E€ Z, m<n+1iffm<norm=n+1. 


4. Absolute Value. For all real x, if x > 0, then |æ| = x; if x < 0, then |x| = —x. For 
x,y,r € R, 0 < |z|, æ] =|—a], |ryl = |z|- |y], and |z- y| < r if y=r < z < y+r. 
The Triangle Inequality says that |a + b| < |a| + |b| for all a,b € R. 


5. Distance. For x,y € R, the distance from x to y is d(x,y) = |æ — y|. Distance 
satisfies positivity, symmetry, and the Triangle Inequality. 


Exercises 


1. Finish the proof of Theorem 8.36. 


2. Finish the proof of Corollary 8.37. 

3. Finish the proof of Theorem 8.38. 

4. (a) Finish the proof of Theorem 8.40. (b) State and prove analogous properties 
for negative numbers, nonnegative numbers, and nonpositive numbers. 

5. Finish the proof of Theorem 8.43. 


Finish the proofs of Theorem 8.44 and Theorem 8.46. 
Disprove: for all a,b € R, |a — b| < |a| — |b]. 
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8. For all m,n € Z, prove: (a) m < n+1 iff m < n; (b) m > n-1 iff m > n or 
m=n-—1;(c)m>n-—1 iff m > n. (d) Prove these results do not always hold 
when m,n E€ R. 


9. (a) Assume < is a relation on R satisfying transitivity and trichotomy (properties 
(b) and (d) in Theorem 8.36). For x,y € R, define x < y to mean x < y or £ = y. 
Prove that < satisfies axioms O1, O2, 03, and O4. 

(b) Assume < satisfies axioms O5 and O6. Prove that < as defined in part (a) 
satisfies axioms O5 and O6. 


10. In some developments of R, the set Ryo of positive real numbers is taken as an 
undefined concept. Axioms O1 through O6 are replaced by these axioms: (P1) For 
all x,y E€ R>0, £x +y € Rso. (P2) For all x,y € R>0o,£ -y € R>o. (P3) For all 
x € R, exactly one of these statements is true: x € Ryo; « = 0; —x € Ryo. For 
x,y E€ R, x < y is defined to mean y — x € Ryo, and x < y is defined to mean 
x < yor x = y. Show that axioms O1 through O6 follow from these definitions 
and axioms P1, P2, P3. [Use the previous problem.] 


11. Define the set of complex numbers C to be the product set R x R. Define algebraic 
operations on complex numbers as follows: for all x,y, s,t € R, let 


(x,y) + (s,t) = (x +5,y +t), —(x,y) = (~x, —y), Oc = (0,0), 
Ic = (1,0), (x,y) + (s, t) = (xs — yt, xt + ys), 
-1 T TY 
(z,y) = = (o =) for (x, y) # Oc. 
Use known algebraic properties of R to show that C satisfies the field axioms (Al 


through A5, M1 through M5, D, and N). It follows that all algebraic rules proved 
for R from these axioms also hold in C. 

12. (a) Show that the equation z? = —1 has exactly two solutions z € C. (b) Hence 
show that there does not exist an ordering relation < on the field C satisfying 
axioms O1 through O6. 

13. Show that for p prime, there does not exist on ordering relation < on the field 
Zp satisfying axioms O1 through O6. (See the exercises of §8.3 for the definition 
of Z, and the algebraic operations on this set.) 


14. Prove that Z is not a field. 
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8.5 Greatest Elements, Least Upper Bounds, and Completeness 


Every nonempty finite subset of R has a greatest element and a least element; this can be 
proved by induction from the total ordering axiom (04). However, an infinite subset of R 
may or may not have maximum and minimum elements. This leads to the concept of a least 
upper bound for a set of real numbers, which generalizes the idea of the maximum element 
of a set. The Completeness Axiom (O7) guarantees that least upper bounds exist whenever 
possible. This axiom has many striking consequences about the structure of the real number 
system, and it is absolutely fundamental for proving the major theorems of calculus. 

After discussing the basic definitions, we use completeness to prove some familiar but 
technically subtle facts about how the number systems Z and Q appear within R. In par- 
ticular, we show that given any real number there is a larger integer (the Archimedean 
Property); any half-open interval [a,2 + 1) of length 1 contains exactly one integer; and 
every open interval (a, b) contains a rational number. We also prove some strong versions of 
completeness that apply to sets of integers, such as the Well-Ordering Principle and the fact 
that if a nonempty set of integers is bounded above, then that set has a greatest element 
(not just a least upper bound). Finally, we use completeness to prove the Nested Interval 
Property of real numbers and the fact that every positive real number has a real square 
root. 


Maximum and Minimum Elements 


We recall the definitions of the maximum element and minimum element of a set; these 
definitions make sense in any partially ordered set. 


8.47. Definition: Maximum and Minimum Elements. Fix S C R and z E€ R. 


(a) | zo = max S | iff | zo E€ S and Va € S, x£ < zo. 


(b) 
(c) 
( 


d) | S has a least element | if | 3z € S,Yx E€ S, z < x. 


S has a greatest element | iff | 3z € S,YVx E€ S, £ < z. 


zo = min S | iff | zo € S and Va E S, 20 <a. 


We stress that max S and min § need not exist; see the examples below. But if S has a 
greatest element, that element is unique. For suppose y,z € R are both greatest elements of 
S. Since y is a greatest element and z € S, we deduce z < y. Since z is a greatest element 
and y € S, we deduce y < z. By antisymmetry (axiom O2), y = z. Similarly, min S' is 
unique when it exists. 


8.48. Example: Intervals. Fix a < b in R. The closed interval S = [a,b] = {x ER: 
a<a <b} has greatest element b = maxS and least element a = min S. On the other 
hand, we claim the open interval T = (a,b) = {x € R: a < x < b} has no greatest element 
and no least element. To prove that T has no maximum, we negate the definition and prove 
Vz €T,da € T,z < x. Fix z E€ T, so a < z < b. Choose x = (z + b)/2. You can check that 
a <z < (z +b)/2 < b, soz < xanda < x< b. Thus, x is a member of T larger than z. A 
similar proof shows that T has no least element. Observe, in particular, that a cannot be 
the least element of T since a does not belong to T. 


8.49. Example. Every nonempty finite set {£1, .. ., £n} C R has a maximum and minimum 
element, as can be proved by induction on n. However, the empty set Ø has no maximum 
or minimum element, since the empty set has no elements in it at all. 
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8.50. Example. The entire set R has no least element and no greatest element. For, given 
any x € R, adding x to the inequalities —1 < 0 < 1 shows that x — 1 < x < x + 1, so there 
exist elements of R greater than x and less than x. The same proof shows that Z has no 
least element and no greatest element. On the other hand, R>o has least element 0, and 
N = Zyo has least element 1, by Lemma 8.41 (a). 


We just saw that the set N of natural numbers has a least element. In fact, N satisfies 
a much stronger condition called the well-ordering property: every nonempty subset of N 
has a least element. We use the following terminology in the proof: for each n € N, define 
[n] = {m E N: m < n}. We proved earlier that every natural number m satisfies 1 < m, 
so we can also write |n] = {m Ee N: 1 < m < n}. It follows from Lemma 8.41(e) that 
[n + 1] = [n] U {n + 1} for all n EN. 


8.51. Theorem on Well-Ordering of N. For all S CN, if S Æ 0, then S has a least 


element. 


Proof. First we prove the following statement by induction on n: for all S C N, if SA [n] 
is nonempty then S has a least element. For the base case, consider n = 1. Fix S C N and 
assume SM [1] is nonempty. Now, [1] = {m E N:1 <m < 1} = {1} by antisymmetry. The 
assumption $M [1] 4 @ forces 1 € S. For any k € S, we know k € N, so 1 < k. Thus, 1 is 
the least element of S in this case. 

For the induction step, fix n € N and assume: for all S CN, if SN [n] Æ Ø then S has a 

least element. We must prove: for all S CN, if S A [n + 1] 49, then S has a least element. 
Fix S CN satisfying SM [n + 1] 4 9. Since [n + 1] = [n] U {n + 1}, there are two cases. 
Case 1. Assume SN [n] 4 Ø. By induction hypothesis, S has a least element. 
Case 2. Assume $1 [n] = Ø, which forces n +1 € S. We claim n + 1 is the least element 
of S. To see this, fix k € S. By trichotomy, we know n +1 < k or k < n + 1. The second 
alternative k < n + 1 is impossible, since it would imply k € S N [n + 1] = {n + 1}, leading 
to the contradiction k = n + 1. Thus, n +1 < k for each k € S, as needed. 

To prove the theorem itself, fix S C N and assume S is nonempty. Then there exists 
a natural number n € S. It follows that S N [n] is nonempty since n € [n]. We can now 
conclude that S' has a least element. 


Least Upper Bounds and Completeness 


The concept of a least upper bound is a kind of substitute for the greatest element when 
the latter does not exist. For example, the open interval (a,b) has least upper bound b. To 
make this precise, we need some additional definitions. 


8.52. Definition: Upper and Lower Bounds. Fix S C R and zo E€ R. 


(a) | zo is an upper bound for S | iff | Yx € S, x < zo. 


b) | S is bounded above | iff | 3z € R, Vx € S, x < 2. 


( 
(c) | zo is a lower bound for S | iff | Yx € S, zo < x. 
( 


d) | S is bounded below | iff | 3z € R, Vx € S, z < x. 


As an example, consider the interval S = [2, 4). Some upper bounds for S are 5, 7, 21/2, 
and 4. In fact, the set of all upper bounds of S is the interval [4, o0), and this set of upper 
bounds has least element 4. We therefore say 4 is the least upper bound (lub) of S and 
write 4 = lub S. Similarly, the set of all lower bounds of S' is the interval (—o0o, 2]. This set 
of lower bounds has greatest element 2, so we say 2 is the greatest lower bound (glb) of S. 
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Here is a related example that motivates the introduction of the Completeness Axiom. 
Consider the set S = {x € Ryo : x? < 2}. Intuitively, for positive real x, the condition 
x? < 2 is equivalent to x < V2, so the set of upper bounds for S ought to be the interval 
[V2,00). (This is only an intuitive argument for now, since we have not yet defined or 
proved the existence of v2.) Thus lub § should exist and be equal to v2. On the other hand, 
suppose we restrict ourselves to using rational numbers only. Intuitively, the restricted set 
S' = {x € Q>o : 2? < 2} has set of upper bounds [V2, 00) NQ. This set of upper bounds has 
no least element since V2 is not rational, but there are rational numbers arbitrarily close 
to v2 and larger than v2. Thus lub S’ does not exist in the ordered set Q, even though S’ 
is bounded above. This example reveals that the number system Q has a “hole” in it where 
V2 should be. The Completeness Axiom for R says, intuitively, that R “has no holes” in the 
sense that least upper bounds exist in every case where this is possible. We now officially 
define lub S and glb S and state the Completeness Axiom. 


8.53. Definition: Least Upper Bounds. Fix S C R and z € R. Define 
zo = lub S| iff zo is an upper bound of S and for every upper bound y of S, zo < y. 


Some texts denote lub S by sup S (the supremum of S). 


8.54. Definition: Greatest Lower Bounds. Fix S C R and zp € R. Define 
zo = glb S| iff zo is a lower bound of S and for every lower bound y of S, y < zo. 


Some texts denote glb S by inf S (the infimum of S). 
Expanding the definition of a least upper bound, we see that 


z=lubS iff [Nze S,x < zo and Vy E R, (Yz € S,x < y) > z < yl. 


Taking the contrapositive of the final implication and using trichotomy, we arrive at the 
following phrasing of the definition that is often more convenient in proofs: 


z=lubS| & |[MreE sS, x< zo] A [Ny ER,y < z > Jr E€ S,y < zx]! 


In words, zo is the least upper bound of S$ iff zo is an upper bound of S and each y < zo is 
not an upper bound of S. Similarly, 


z=glbS| < |[Mz E€ S, zo < z] A [Vy € R, zo < y > Jx € S,x < y] |. 


Note that lub S' is the minimum element of the set of upper bounds of S, and glbS is the 
maximum element of the set of lower bounds of S. Thus, lub S and glb S are unique when 
they exist. The Completeness Axiom postulates a basic sufficient condition for the existence 
of lub S. 


8.55. Completeness Axiom (O7). Every nonempty subset of R that is bounded above 
has a least upper bound in R. 


As a first illustration of this axiom, we use it to derive the analogous sufficient condition 
for the existence of greatest lower bounds. 


8.56. Theorem on Greatest Lower Bounds. For all S C R, if S is nonempty and 
bounded below, then S has a greatest lower bound in R. 


Proof. Fix S C R that is nonempty and bounded below by xo. [The main idea of the proof 
is to use negation to flip the ordering, turning lower bounds into upper bounds and least 
upper bounds into greatest lower bounds.] Define a new set T = {—y : y € S}, which is 
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nonempty since S is nonempty. For all y € S, we know zo < y, so —y < —xo. This means 
that —29 is an upper bound for the set T. By the Completeness Axiom (O7), zọ = lub T 
exists in R. We show that —zọ = glb S. On one hand, for fixed y € S, we know —y E T, 
so —y < zo and hence —zo < y; this proves that —zo is a lower bound for the set S. On 
the other hand, consider a fixed real wo > —29. Then —wo < 29, so —Wo is not an upper 
bound for T; so there exists t € T with —wo < t. Writing t = —y for some y € S, we have 
—wo < —y, SO y < Wo, SO Wo is not a lower bound for S. This means that —zp is the greatest 
lower bound for S, as needed. 


The Archimedean Ordering Property 


We often visualize R as a number line extending infinitely far to the left and right. Each 
real number x corresponds to exactly one point on this line, and the order relation £ < y 
means that the point for x is located to the left of the point for y. Our intuition tells us 
that no matter how far to the right we go on this line, there will always be positive integers 
even farther to the right. The next theorem uses the Completeness Axiom to justify this 
pictorial intuition. 


8.57. Theorem on Archimedean Property of R. 


(a) Va € R, In € Zso,a <n. 
(b) Vy € Rso, dn E Zso,1/n < y. 


Proof. We prove (a) by contradiction. Assume, to get a contradiction, that Jx € R, Yn € 
Zso,n < x. This assumption is precisely the statement that the set Z>o is bounded above. 
By the completeness axiom for R, the set Z>o has a least upper bound, say zo = lub Zyo. 
Now zo— 1 < xg, so £o — 1 is not an upper bound for Z»o. Thus there must exist a positive 
integer n with £o — 1 < n. Adding 1 to both sides, we get zo < n + 1, where n + 1 is 
also a positive integer. Since zo is an upper bound for Zo, we have n + 1 < xo. But now 
zo < n +1 < 2 contradicts trichotomy. This proves (a). 

To prove (b), fix y € Ryo. Then 1/y is a positive real number, and by (a) there exists a 
positive integer n with 1/y < n. By Theorem 8.40(a), we conclude that 1/n < 1/(1/y) = y, 
as needed. 


Ordering Properties of Z 


The Completeness Axiom tells us that any subset S C R that is nonempty and bounded 
above has a least upper bound. If S happens to be a subset of Z, we can show that a 
stronger result holds: S actually has a maximum element. Similarly, if S C Z is nonempty 
and bounded below, then S has a minimum element. To prove these statements, we reduce 
to the well-ordering property of N established earlier. 


8.58. Theorem on Strong Completeness of Z. 

(a) For all S C Z, if S is nonempty and S' is bounded below (in R) then S has a least 
element. (b) For all S C Z, if S is nonempty and S is bounded above (in R) then S has a 
greatest element. 


Proof. For (a), fix S C Z and assume S is nonempty and bounded below. Let x € R be a 
lower bound for S. By the Archimedean Property, there is an integer N with —x < N, so 
—N < zx. [The main idea of this proof is to translate S C Z to the right N units to get a 
subset of N, thus reducing the result in (a) to the Well-Ordering Principle.] Consider the set 
T ={m+N : m € S} obtained from S$ by adding N to each element. Given any element 
k=m+N€ET with m € S, note that -N < x < m, so 0< m+N = k, so k € Zo =N. 
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So T is a subset of N, and T is nonempty since S' is nonempty. Since N is well-ordered, T 
has a least element ky = Mmo + N, where mg € S. For any m € S, we have m+ N € T. The 
fact that mo + N = ko < m+ N implies mp < m. Thus, mo is the least element of S. 

For (b), fix S C Z and assume S$ is nonempty and bounded above. Let x € R be an 
upper bound for S. [Here the idea is to flip the order by negating everything, as in the 
proof of Theorem 8.56.] Define T = {-m: m € S}. Since m < x for all m € S, we have 
—ax < —m for all —m € T. Thus, — is a lower bound for T. Also T is nonempty since S is 
nonempty. By (a), T has a smallest element, say —mp with mp € S. Given any m € S, we 
know —mg < —m, and hence m < mo. Thus, mo is the largest element of S. 


When we draw a number line to represent R, we often make tick marks, spaced one unit 
apart, to represent the integers. Based on such a picture, it is highly believable that for 
every real x, the half-open interval |z, + 1) must contain exactly one integer. The next 
theorem justifies this pictorial intuition. 


8.59. Theorem on Distribution of Z in R. |Yx E€ R, J3!n EZ, z <n<zr+l1. 


Proof. Fix x € R. To prove existence of n, let S = {m € Z : x < m}. By the Archimedean 
Property, S is nonempty. Also x is a lower bound for S, so we know S has a least element n. 
To get a contradiction, assume that x +1 < n. Then x < n—1 where n— 1 is an integer (by 
closure of Z under subtraction). But this means n — 1 € S; since n— 1 < n, this contradicts 
the choice of n as the least element of S. Hence, n < x + 1 must hold. We also know z < n 
since n € S. 

To prove uniqueness of n, suppose (to get a contradiction) that nı and nz are distinct 
integers that both belong to the interval [x, x +1). We can choose notation so that nı < ne, 
and hence 0 < n2 — nı. On the other hand, —nı < —x and ng < x + 1 combine to give 
n2 — nı < (x +1)— x = 1. Now na — nı is an integer in the open interval (0,1), which 
contradicts Lemma 8.41(d). So the n found above is unique. 


Density of Q in R; Real Square Roots 


The next result says that Q is dense in R, meaning that arbitrarily small open intervals 
contain rational numbers. 


8.60. Theorem on Density of Q in R. |Va,b E€ R,a < b= J3qEQ,a<q<b. 


Proof. Fix a,b € R with a < b. Since 0 < b — a, we can use the Archimedean property to 
find a positive integer n with 1/n < b — a. Adding a — (1/n) to both sides, we find that 
a < b—(1/n); and b — (1/n) < b since 0 < 1/n. [The key idea is to magnify the interval 
[b—1/n,b) by a factor of n to get the interval [nb— 1, nb) of length 1.] Applying Theorem 8.59 
to the real number x = nb — 1, we obtain a (unique) integer m with nb— 1 < m < nb. 
Multiplying these inequalities by the positive quantity 1/n, we get b — (1/n) < m/n < b. 
Since a < b — (1/n), transitivity gives a < m/n < b. As m and n are integers, q = m/n is a 
rational number in (a, b). 0O 


The set of irrational numbers is also dense in R, but to prove this we must first know 
that irrational numbers actually exist! We now use the Completeness Axiom to show that 
all positive real numbers have real square roots. Combining this with the earlier theorem 
that 2 has no rational square root, we deduce the existence of real numbers that are not 
rational. We may also conclude that Q cannot satisfy the Completeness Axiom (otherwise, 
the proof given below would apply to the number system Q and prove that 2 had a square 
root in Q, which is not true). 
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8.61. Theorem on Existence of Real Square Roots. | Va € Ryo, dx € Ryo, x? = a. 


Proof. The proof uses the following fact, which is left as an exercise: for all y,z € Rso, 
y < z iff y2 < 27. We first prove the result for a fixed a € R> 1. Since 1 < a and a is positive, 
we see that a < a?. [To use the Completeness Axiom to solve x? = a, we need a set of real 
numbers with least upper bound z.] Define the set S = {z € Rso : z? < a}. On one hand, 
S is nonempty because 0? = 0 < a, so that 0 € S. On the other hand, we claim that S is 
bounded above with upper bound a. For given an arbitrary z € S, we know 2? < a < a?, 
so z <a by the initial fact. 

We have now verified the hypotheses of the Completeness Axiom (O7), so we can con- 
clude that x = lub S exists in R. Since 0 € S, 0 < x. By trichotomy, x? < a or a < 2? or 
x? =a. The third case is the required result; we finish the proof by showing that the first 
two cases lead to contradictions. 

First assume that x? < a. On one hand, this means x € S, so x < a as shown above. 
On the other hand, for any n € N, we know 1 < n, so n < n? and 1/n? < 1/n. Now 
calculate (2 + 1/n)? = x? + (2/n)a + (1/n?) < x? + (2/n)a+ (1/n) < x? + (2a + 1)/n. By 
the Archimedean Property applied to the positive number (a — xz?) /(2a + 1), we can choose 
a positive integer n with 1/n < (a — x°)/(2a + 1). Then z < x +1/n and (x + 1/n)? < 
x? + (2a + 1)/n < x? + (a — x?) = a. So x + 1/n belongs to S and is larger than x; this 
contradicts the fact that x is an upper bound for the set S. 

Now assume that a < x°. This time, we calculate (x — 1/n)? = x? — (2/n)x + (1/n?) > 
x? — (2x/n). By the Archimedean Property, there exists a positive integer n with 1/n < x 
and 1/n < (x? — a)/(2x), so x — (1/n) > 0 and (x — 1/n)? > x? — (2x/n) > a. Given any 
y € S, note that y? < a < (x —1/n)?, so y < x —1/n by the initial fact. But this means 
x —1/n is an upper bound for S, contradicting the choice of x as the least upper bound for 
the set S. 

Finally, we prove the result for fixed a between 0 and 1. Since 1/a > 1, we have just 
shown that there exists y € Ryo with y? = 1/a. Then x = 1/y satisfies x? = 1/y? = a, as 
needed. 


Nested Interval Property 


As a final illustration of the Completeness Axiom, we prove a theorem stating that the 
intersection of any sequence of nested closed intervals is always nonempty. This is another 
way of formulating the intuitive notion that the real number system R has no holes. 


8.62. Nested Interval Theorem. Suppose {[a,,, bn] : n € Zso} is a collection of nonempty 
closed intervals such that [an+1, bn+1] C [an bn] for all n € Z>o. Then NZ lan, bn] #0. 


Proof. For each n € Zso, we have an < bn since [an, bp] is nonempty, an < an+ı since 
An+1 E lan+1; bn+1] C lan, bn], and bn+1 < bn since bn+1 & lan+1; bn+1] C lan, bn]. By 
induction, we readily deduce the following consequences of these inequalities: for all positive 
integers i < j, a; < aj and bj < bi; and for all positive integers k,n, ak < bn. 

Now consider the set S = {an : n € Z>o}, which is a nonempty subset of R. This set 
is bounded above by bı, since any a, € S satisfies an < bı. By the Completeness Axiom 
(O7), there exists r € R with r = lub S. We claim r € NX; [an, bn], which proves that this 
intersection is nonempty. Fix n € Zo; we must show r € [an, bn], which means a, < r and 
r < bn. Since r is an upper bound for S, a, < r is true. Assume, to get a contradiction, 
that r < bn is false. Then bp < r = lub S, so bn is not an upper bound for S. So there exists 
ak E S with bn < ak, contradicting the previous paragraph. Thus, r < bn is true. 


One consequence of the Nested Interval Theorem is that for any sequence (zn : n € Zso) 
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and any interval [a,b], there is a point in [a,b] different from all z,,; in other words, [a,b] 
is an uncountable set (see $7.4). The Nested Interval Theorem can also be used to prove 
the Intermediate Value Theorem for continuous real-valued functions. Given a continuous 
f : [a,b] > R and c € R between f(a) and f(b), the idea is to find x € R solving f(x) = c 
by repeatedly bisecting the interval [a,b] and taking x in the intersection of the resulting 
sequence of nested closed intervals. 


Section Summary 


1. Definitions. For S C R and zp € R, Zo is an upper bound of S iff Vx € S,x < 29; 
zo = max S iff zọ € S and zo is an upper bound of S; zo = lub S (the least upper 
bound of S) iff zo is an upper bound of S and every y < zo is not an upper 
bound of S. Lower bounds, min S, and the greatest lower bound glb § are defined 
similarly. 

2. Completeness and Related Properties. The Completeness Axiom states that every 
nonempty subset of R that is bounded above has a least upper bound. Conse- 
quently, every nonempty subset of R that is bounded below has a greatest lower 
bound. A nonempty subset of Z that is bounded above (resp. below) has a great- 
est (resp. least) element. Every nonempty finite subset of R has a greatest and a 
least element. For a general subset S of R, max S, min S, lub S, and glb S need 
not exist; but each of these numbers is unique when it exists. 

3. Archimedean Property. For each x € R, there is a positive integer n with x < n. 
For each y € Ryo, there is a positive integer n with 1/n < y. 

4. Distribution of Z and Q in R. For each x € R, there is exactly one integer n 
such that x < n < x + 1. For each a < b in R, there is a rational number q with 
a<q<b. 

5. Existence of Real Square Roots. For each a € Ryo, there exists x € Ryo with z? = 
a. This statement does not hold for Qo, so Q does not satisfy the Completeness 
Axiom. 

6. Nested Interval Property. Any collection of nonempty closed intervals |an, bn] 
that are nested (i-e., [@n41,bn+1] C [an, bn] for all n € Z>0) must have nonempty 
intersection in R. 


Exercises 


1. (a) Prove: for all y,z € Ryo, y < z iff y? < z?. (b) Does either implication in (a) 
hold for all y, z € R? 

2. (a) Prove: for all x,y € R, if x < y, then z < (x + y)/2 < y. (b) More generally, 
prove Yx, y,t € R, if x < y and 0 < t< 1, then z < tz + (1 — t)y < y. (c) Prove: 
for all a,b € R, if a < b, then the open interval (a,b) has no least element. 


3. Prove that R<o has no maximum and no minimum. 


4. Let S = {3 — 1/n : n € Zyo}. Find (with proof) lub S and glb S. Does max S 
exist? Does min S' exist? 

5. (a) Prove: For all S C R, if S is bounded above and bounded below, then glb S < 
lub S. (b) Find all sets S C R such that glb S and lub S exist and are equal. 


6. Recall that [n] = {1,2,...,n} = {m € Zso : m < n}. Prove: for all n € Z>o 
and all functions f : [n] > R, the image of f has a greatest element and a least 
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10. 
11. 
12. 


13. 
14. 


15. 


16. 
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element. Deduce that every nonempty finite subset of R has a maximum and a 
minimum element. 


(a) If S is not bounded above, what is the set U of all upper bounds of S? Does 
U have a least element? Does lub S exist? (b) Repeat part (a) for S = Ø (the 
empty set). 


Prove: Vy € Rso, Ve € Ryo, dn E Zso, ne > y. 

(a) Where is the flaw in this proposed proof of Theorem 8.57(a)? “Fix ao in 
R. Choose n = zo + 1. Since we know 0 < 1, adding zo to both sides gives 
zo < zo +1 = n, as needed.” (b) Where is the flaw in this proposed proof of 
Theorem 8.60? “Fix a,b € R with a < b. Choose q = (a+ b)/2. Since a < b, an 
earlier exercise gives a < q < b, as needed.” 


Prove: for all x € R, there exists a unique n € Z witha <n<a+l1. 


Give a careful proof that {[n,n + 1): n € Z} is a set partition of R. 


Prove: for all a,b € R, if a < b, then there exists x E€ R— Q with a < x < b. [Hint: 
We know V2 € R—Q, and we know there is a rational number in (a+ V2, b+ V2).] 
Prove that every a € Ryo has a unique positive square root. 


Let (X, <) be any partially ordered set satisfying the Completeness Axiom (07). 
Prove that for all S C X, if S is nonempty and bounded below, then S$ has a 
greatest lowest bound. (This gives another proof of Theorem 8.56 not relying on 
the algebraic structure of R.) 


For x = (z1,..., £n) and y = (y1,.--,Yn) in R”, define 


d(x,y) = max{|xı = yıl, EEE. |En = Ynl}- 


Prove that this distance function satisfies the metric space axioms in Theo- 
rem 8.46. 


For £ = (a1,...,2n) and y = (y1,.--, Yn) in R”, define d(x, y) = Xz] |ve — ysl: 
Prove that this distance function satisfies the metric space axioms in Theo- 
rem 8.46. 


Suggestions for Further Reading 


In this text, we have covered fundamental concepts of logic, proofs, set theory, integers, 
relations, functions, cardinality, and real numbers that are absolutely essential for further 
pursuit of advanced mathematics. Each of the subjects we have studied is a vast subdis- 
cipline of mathematics that goes far beyond the introductory material discussed here. We 
now suggest some books that interested readers may consult to learn more about these 
fascinating topics. 


Logic and Set Theory 


Mathematical logic is a beautiful and amazing subject that applies the precise tools of 
mathematical reasoning to explore the notions of proof, truth, and computability. 


Logic for Mathematicians by A. G. Hamilton (Cambridge University Press, 2001) is a 
very readable introduction to this field. 


Godel, Escher, Bach: An Eternal Golden Braid by Douglas Hofstadter (Basic Books, 
1999) is aremarkable, Pulitzer Prize-winning book that weaves together logic, art, music, 
mathematics, molecular biology, and much more. The book is written for a popular audi- 
ence but is loaded with technical content including multiple explanations and metaphors 
for Gédel’s celebrated Incompleteness Theorem. 


Mathematical Logic by Joseph R. Shoenfield (Association for Symbolic Logic, 1967) is a 
more advanced and challenging logic textbook, which is extremely carefully written. 


Axiomatic set theory rigorously develops properties of sets beginning with an explicit 


list of axioms. We sketched the beginnings of this subject in the optional Section 3.7. 


Naive Set Theory by Paul Halmos (Springer, 1974) is a famous 100-page book by one of 
the greatest mathematical writers. Despite the title, this book really is about axiomatic 
set theory, although Halmos focuses on those aspects of set theory most relevant to other 
parts of mathematics. 


Introduction to Set Theory (third edition) by Karel Hrbacek and Thomas Jech (CRC 
Press, 1999) is a solid exposition of set theory appropriate for advanced undergraduates. 
One attractive feature of this book is that each new axiom of set theory is carefully 
motivated before it is introduced, giving readers intuition for why these particular axioms 
were chosen. 


Introduction to Set Theory by J. Donald Monk (McGraw-Hill, 1969) is an advanced but 
very well-written development of a version of set theory in which proper classes (such 
as the class of all sets) are allowed. After summarizing basic material on sets, relations, 
and functions (similar to what we covered in this text), the book provides an outstanding 
treatment of ordinal numbers, the axiom of choice, and cardinal numbers. 
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Analysis 


Analysis is the branch of mathematics that rigorously develops the calculus of real-valued 
functions and their generalizations. This subject begins with the ordered field axioms for 
real numbers, which are covered in Chapter 8 of this text. 


e Advanced Calculus (second edition) by Patrick Fitzpatrick (American Mathematical Soci- 
ety, 2006) gives a very readable account of single-variable and multivariable real analysis. 


e The Real Numbers and Real Analysis by Ethan Bloch (Springer, 2011) is another nice 
advanced calculus textbook that starts with axiomatic developments of various number 
systems (integers, rational numbers, and real numbers). 


e Analysis on Manifolds by James Munkres (Westview Press, 1991) is this author’s all-time 
favorite mathematics book. It gives a wonderful treatment of multivariable calculus topics 
culminating in chapters on manifolds in R”, differential forms, and the Generalized Stokes 
Theorem. 


e Real Analysis by Haaser and Sullivan (Dover, 1991) is an advanced textbook on abstract 
analysis. This book lucidly describes many topics including the formal construction of R 
from Q via Cauchy sequences, the theory of metric spaces, a development of the Lebesgue 
integral via the Daniell approach, Banach spaces, and Hilbert spaces. 


Number Theory and Algebra 


Number theory studies properties of the natural numbers. Our discussion of divisibility, 
primes, unique prime factorizations of integers, and modular arithmetic barely scratch the 
surface of this ancient subject, which has many unexpected practical applications to modern 


cryptography. 


e The Higher Arithmetic (eighth edition) by Harold Davenport (Cambridge University 
Press, 2008) is a wonderful introduction to number theory with short chapters on many 
of the key topics in this area. 


e A Guide to Elementary Number Theory by Underwood Dudley (MAA Press, 2009) is 
a fantastically clear exposition of the basic theorems of number theory that is only 140 
pages long. 


e Elementary Number Theory and its Applications (sixth edition) by Kenneth Rosen (Pear- 
son, 2010) is a thorough, comprehensive undergraduate textbook on number theory. 


Abstract algebra starts with familiar properties of integers and real numbers (like as- 
sociativity) and considers these properties in far more general settings. What results is an 
elegant abstract theory that applies to any structure satisfying the initial axioms. 


e Ideals, Varieties, and Algorithms (fourth edition) by Cox, Little, and O’Shea (Springer, 
2015) is an amazing introduction to computational algebra that focuses on the algebra of 
multivariable polynomials. The book perfectly blends theory and computation. 


e A First Course in Abstract Algebra (third edition) by Joseph Rotman (Pearson, 2005) is 
a nice introduction to groups, rings, and fields by one of the best writers on this subject. 
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(a,b), see ordered pair or open interval 
(a, b,c), see ordered triple 
2,3,...,10 defined, 350 

A — B, see set difference 

AN B, see intersection 

AUB, see union 

AC B, see subset 

A D B, see superset 

A x B, see product set 

AAB, see symmetric difference 
Ix, see identity relation 

P= Q, see if 

P@Q, see exclusive-OR 

PV Q, see or 

PAQ, see and 

R{C], see image of set under relation 
R-t, see inverse of a relation 
Y~ (set of functions), 338 

C, see complex numbers 

N, see natural numbers 

Q, see rational numbers 

R, see real numbers 

R?, 134 

Z, see integers 

Z/n (integers mod n), 296, 308 
Z x Z is countably infinite, 324 
Nicer Ai, see general intersection 
Ua, see formal union 

User Ai, see general union 

Xa, see characteristic function 
6, see delta 

Ü, see empty set 

=n (congruence mod n), 284 
x, see existential quantifier 
lx, see uniqueness symbol 
Va, see universal quantifier 

glb S, see greatest lower bound 
glue(f; : i € I), see gluing 

Ida, see identity function 
€-transitive set, 131 

inf S, see greatest lower bound 
<lex; 307 

lub S, see least upper bound 


max S, 303 

min S, 303 

~P, see not 

ĮI}; ak (product notation), 160 
P(A), see power set 

sgn(x), see sign function 


~p (equivalence relation induced by P), 
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~y (equivalence relation induced by f), 283 


X; ak (summation notation), 160 
sup S, see least upper bound 

|A| = |B| (cardinal equality), 322 
|A| < |B| (cardinal ordering), 322 
|X| =n, 315 

|x|, see absolute value 

{x : P(x)}, see set-builder notation 
a/b (formal construction), 311 

a|b (a divides b), 185 

f[A], see direct image 

ft, see inverse function 

f—[B], see preimage 

fls, see restriction of a function 

g : A > B (function notation), 231, 233 
go f, see function composition 
ja,B, see inclusion function 

n! (factorial), 159 

nth root, 200 


absolute adjective, 43 
absolute value, 78, 371 
absorption law, 9, 120 
abstract algebra, 384 
addition, 240, 349 
axioms, 350 
cancellation law, 352 
in Q, 311 
inverse, 240, 349 
mod n, 308 
set closed under, 338 
table, 355 
additivity of sums, 164 
advice for set proofs, 123 
algebraic properties of order, 369 
algebraic structure, 351 
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386 


algorithm for gcds, 186 
all 
definition of, 27 
inference rule, 50 
proofs via generic elements, 82 
analysis, 349, 384 
and 
definition of, 3 
proof template for, 64 
antecedent, 6 
antisymmetry, 301 
of cardinal ordering, 336 
Archimedean Property of R, 378 
arcsine, 270 
arithmetic mean, 51 
arithmetical fact, 353 
arrow diagram, 211, 246, 260, 339 
and inverses, 219 
of restriction, 248 
associativity, 9, 49, 52, 242, 247 
generalized, 354 
of composition of relations, 226 
axiom, 48, 51 
definitional, 49 
for Z, 49 
for addition in R, 350 
for groups, 52 
for multiplication in R, 350 
for positive real numbers, 374 
of ZFC set theory, 156 
Axiom of Abstraction, 149 
Axiom of Choice, 154 
Axiom of Completeness, 351, 377 
Axiom of Extension, 113, 150 
Axiom of Foundation, 129, 155 
Axiom of Induction, 160 
Axiom of Infinity, 153 
Axiom of Pairs, 152 
Axiom of Power Sets, 152 
Axiom of Regularity, 156 
Axiom of Replacement, 154 
Axiom of Specification, 151 
Axiom of the Empty Set, 153 
Axiom of Unions, 152 
axiomatic set theory, 149, 383 


backward direction, 65 
backwards induction, 169 
base b expansion, 184 
base case, 161 
biconditional, 6 


bijection, 260, 261 
and inverses, 269 
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between equivalence relations and set 


partitions, 297 

composition of, 264 
bijective function, 261 
binary union, 152 
bisquare, 62 
bit, 13 
block of a set partition, 295 
bounded above, 351, 376 
bounded below, 376 


cancellation law, 52, 352 
Cantor’s diagonal argument, 333, 334 
Cantor’s Theorem, 334 
cardinal equality, 322 
cardinal ordering, 322 
cardinality, 315 
definition review, 341, 344 
of power set, 334 
Cartesian graph, 339 
cases, 76 
chain in a poset, 303 
chain proof, 121 
characteristic function, 243 
properties, 243 
characterization 
of divisors, 197 
of subsets, 120 
Choice Axiom, 154 
circle proof, 119 
closed interval, 132 
is uncountable, 334, 381 
closed set, 148 
closure, 49, 52, 350 
and irrational numbers, 79 
of R0, 356 
of integers, 364 
of natural numbers, 363 
of rational numbers, 365 
Closure Theorem for Relations, 226 
codomain, 231, 233 
enlarging, 238 
commutative group, 351 
commutative ring, 351 
commutativity, 9, 49, 242 
generalized, 354 
complete induction, 173 
Completeness Axiom, 351, 377 
completeness of Z, 378 
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Completeness Theorem, 52 rules, 97 
complex numbers, 28 rules for inequalities, 368 
formal construction of, 374 density of Q in R, 379 
component of ordered pair, 132 diagonal argument, 333, 334 
composite number, 179 difference, 349 
composition digraph, 339 
and inverses, 268 digraph of a relation, 275 
associativity, 226 direct image, 254 
inverse of, 227 properties, 255 
of functions, 246 direct implication, 65 
of injections, etc., 264 direct proof of IF-statement, 58 
of relations, 221 disjoint sets, 115, 316 
properties, 247 disjunction, 6 
concatenation poset, 303 disproof, 79 
conclusion, 6, 10 disproving a false statement, 79 
conditional, 6 distance 
congruence modulo n, 284 in R”, 373 
conjunction, 6 in R, 372 
consequent, 6 distribution of Z in R, 379 
consistent theory, 52 distributive law, 9, 49, 242, 247 
constant, 27 for relations, 227 
generic vs. specific, 83 divides, 47, 185 
constant function, 239 division, 180, 349 
continuity, 88 for negative integers, 181 
contradiction, 21 properties, 358 
contradiction proof, 70 divisor, 185 
contrapositive, 12 and prime factorization, 197 
inference rule for IF, 50 domain, 231, 233 
contrapositive proof, 63 double induction, 366 
converse, 12 Double Inverse Rule, 226 
converse error, 50 dyadic rational, 62 
converse implication, 65 
conversion rules for quantifiers, 32 EI, see existential instantiation 
countability of Q, 325 eight, 350 
countability of Z x Z, 324 either, 19 
countability of finite products, 325 elementary row operation, 188 
countable set, 328 Elements (Euclid’s textbook), 185 
product of, 329 eliminating the uniqueness symbol, 43 
union of, 330 ellipsis, 159 
countably infinite set, 323 empty relation, 213 
cube sum, 164 empty set, 106, 153 
proving a set is empty, 115 
de Morgan’s laws, 10 Empty Set Axiom, 153 
decimal expansion, 184 enlarging the codomain, 238, 242 
defined term, 47 Epimenides’s Paradox, 1 
definition of ordered pair, 136 equality 
definition text, 48 of functions, 241, 339 
definitional axiom, 49 of sets, 113 
delta (Greek letter), 85 proof via chain proof, 121 
denial, 6, 35 equality relation, 282 


practice, 40 equivalence class, 288, 342 
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criterion for equality, 291 formal union, 152 
disjointness property, 291 formula, 51 
equivalence relation, 282, 342 forward direction, 65 
and set partitions, 298, 345 Foundation Axiom, 155 
equivalence classes of, 288 four, 350 
induced by a function, 283 fraction, 311 
induced by a set partition, 296 in lowest terms, 195, 313 
intersection lemma, 285 rules for computing, 359 
Euclid, 185 function 
gcd algorithm, 186 arithmetic operations, 240 
lemma on primes, 193 associated equivalence relation, 283 
Theorem on Primes, 179 bijective, 261 
Euclidean distance function, 373 characteristic, 243 
even, 47 composition, 246 
function, 245 constant, 239 
integer, 169 definition review, 340 
every integer is even or odd, 169, 182 direct image, 254 
exclusive-OR, 3 equality, 241, 339 
definition of, 3 even, 245 
English translation, 5 formal definition, 233 
proof template, 72 gluing, 250 
properties, 21 graph of, 232, 233 
exhaustion, 82 identity, 239 
existence of square roots in R, 379 inclusion, 239 
existential instantiation, 59, 92 informal definition, 231 
existential quantifier, 27 injective, 261 
proof template, 57 inverse, 268, 342 
exponent level set, 258 
laws of, 168, 170 linear, 245 
negative, 171 odd, 245 
notation, 160 one-to-one, 261 
Extension Axiom, 150 onto, 261 
piecewise, 251 
F (false), 2 pointwise operations, 240 
factorial, 159 polynomials, Al 
factorization preimage, 254 
of a product, 197 proving equality, 241 
uniqueness of, 193 restriction, 248 
favorite mathematics text, 384 set of, 338 
fiber of a function, 289 surjective, 261 
Fibonacci number, 191 theorem review, 343 
Fibonacci sequence, 173, 177 well-defined, 235, 339 
closed formula, 178 Fundamental Theorem of Arithmetic, 
field, 351 193 
integers mod a prime, 367 further reading, 383 
is an integral domain, 357 
finite set, 315 Gaussian elimination, 188 
five, 350 gcd algorithm, 186 
FOIL Rule, 16, 357 general intersection, 141 
formal construction of Z from Zso, 314 formal construction, 154 


formal language, 51 properties, 142 


Index 


general union, 141 
formal construction, 154 
properties, 142 
generalized associativity, 354 
generalized commutativity, 354 
generic element, 82 
geometric mean, 51 
gluable family, 250 
gluing, 250 
Goldbach’s Conjecture, 34 
graph 
Cartesian, 339 
of a function, 232, 233 
of a relation, 212 
of inverse relation, 220 
of restriction, 248 
greatest common divisor, 185 
as linear combination, 192 
matrix reduction algorithm, 188 
prime factorization of, 198 
greatest element, 303, 375 
greatest lower bound, 377 
existence of, 377 
group, 351 
axioms, 52 
group axioms, 52 


Haase diagram, 302 
half-open interval, 132 
hidden quantifier, 33 
Horizontal Line Test, 262 
hypothesis, 6, 10 


idempotent law, 9 
identity, 49, 52, 242, 247 
identity function, 239 
identity relation, 220 
if 
contrapositive proof, 63 
definition, 10 
direct proof, 58 
inference rule, 50 
logical properties, 12 
iff (if and only if), 20 
English translations, 20 
proof template, 65 
proof via chain proof, 121 
image 
of set under relation, 213 
properties of, 215 
implication, 6 
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inclusion function, 239 
inclusive-OR, 3 
incomparable elements, 302 
Incompleteness Theorem, 383 
inconsistent theory, 52 
index, 141 
index set, 141 
indexed intersection, 141 
indexed union, 141 
induction 
backwards, 169 
double induction, 366 
formulation using sets, 363 
proof template, 161 
starting anywhere, 166 
stepping by 2, 171 
strong induction, 173 
Induction Axiom, 160, 345 
proof from axioms for R, 363 
induction hypothesis, 161 
induction step, 161 
inductive set, 362 
inequality denial rules, 368 
inference rule, 50, 51 
Inference Rule for ALL, 50, 92 
Inference Rule for IF, 50 
infimum of a set, 377 
infinite interval, 132 
infinite sequence, 332 
infinite set, 315 
infinitely many primes, 179 
Infinity Axiom, 153 
infix notation, 275 
informal set theory, 103 
initial condition, 159 
injection, 260, 261 
composition of, 264 
injective function, 261 
input, 211, 231 
instance, 48, 59, 70 
integer division, 180 
Integer Division Theorem, 180, 182 
integers, 28 
and natural numbers, 371 
are even or odd, 182 
axioms for, 49 
closure properties, 364 
completeness properties, 378 
definition as subset of R, 364 
distribution within R, 379 
form countably infinite set, 323 
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form infinite set, 321 
formal construction from Z>9, 314 
mod n, 290, 296, 308, 367 
review, 203 
integral domain, 357 
Intermediate Value Theorem, 381 
interpretation, 51 
intersection, 104, 105 
formal construction, 151 
of equivalence relations, 285 
of indexed collection of sets, 141 
intersection (| A, 157 
interval, 132 
inverse, 49, 52, 242, 349 
in Zp, p prime, 367 
of a composition, 227 
of a relation, 219 
of functions, 342 
inverse function, 268 
and bijections, 269 
and composition, 268 
left inverse, 270 
properties, 270 
right inverse, 271 
uniqueness, 269 
inverse image, 254 
inverse of P > Q, 16 
invertible function, 268 
IRA, see Inference Rule for ALL 
irrational numbers, 79 
irrationality of V2, TA 
irrationality of roots, 200 
irreflexivity, 307 


knowledge template for subsets, 111 


law of exponents, 168, 170 

least common multiple (lem), 198 

least element, 165, 303, 375 

least upper bound, 351, 377 

Left Cancellation Law, 52, 274, 352 

left inverse, 270 

Lemma on N and Zyo, 371 

Lemma on Divisibility by Primes, 
193 

Lemma on Subsets of Z»o, 328 

Lemma on the Minimum, 198 

level set, 258 

lexicographic ordering, 307 

linear combination 


and greatest common divisors, 192 
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of integers, 187 
linear function, 245 
linearity, 252 
linearity of sums, 164 
logical axiom, 351 
logical connective, 2 
logical equivalence, 9 
logical symbols, 97 
logically equivalent forms, 5 
lower bound, 376 
lowest terms for a fraction, 195 


mathematical logic, 383 
matrix 
elementary row operation, 188 
reduction algorithm for gcds, 188 
maximum, 198 
maximum element of a set, 375 
meaning, 51 
member of a set, 103 
meta-truth table, 14 
metric space, 373 
minimum, 198 
minimum element of a set, 375 
mixed quantifiers, 84 
model, 52 
modus ponens, 51 
modus tollens, 51 
monotonicity 
direct images, 255 
image of relations, 215 
of product set operations, 134 
of set operations, 106 
of unions and intersections, 142 
preimages, 256 
relation operations, 227 
multiple, 185 
multiple quantifiers, 83 
multiplication, 240, 349 
axioms, 350 
by zero, 356 
in Q, 311 
inverse, 240, 349 
inverse in Zp, 367 
mod n, 308 
properties, 358 
set closed under, 338 


natural numbers, 362 
closure properties, 363 
compared to positive integers, 371 


Index 


definition as subset of R, 362 
well-ordering of, 376 
necessary, 18 
negation, 6, 35 
of quantifier, 34 
rules for, 9 
negative, 349, 370 
negative exponent, 171 
Nested Interval Theorem, 334, 380 
nine, 350 
no set of all sets, 151, 336 
No-Universe Theorem, 151, 336 
non-gradable adjective, 43 
not 
definition of, 2 
proof template, 79 
notation for number systems, 28 
number systems, 28 
number theory, 384 
review, 203 
numerals, 350 


odd, 47 
every integer is even or odd, 182 
function, 245 
integer, 169 
sum of odd integers, 162 
one, 349 
one-to-one function, 261 
only if, 18 
onto function, 261 
open disk, 148 
open interval, 132 
open sentence, 27 
open set, 148 
or 
definition of, 3 
proof template, 72 
using known OR-statement, 76 
order properties for R, 368 
ordered field, 351 
and Q, 365 
ordered pair, 132 
definition, 136 
formal construction, 152 
Ordered Pair Axiom, 132 
proof, 137 
Ordered Pair Theorem, 137 
ordered ring, 351 
ordered triple, 139 
ordering 
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algebraic properties, 369 
on Q, 314 
on R, 349 
relation, 301 
ordinary induction proof template, 161 


origin, 134 
output, 211 
pair 


ordered, 132 
unordered, 125 
Pair Axiom, 152 
pairwise disjoint sets, 115, 147, 316 
paradox, 1 
partial order, 301 
strict, 307 
piecewise function, 251 
pitfall, 342 
pointwise operations on functions, 


240, 242 
polynomial, 241 
poset, 301 


concatenation, 303 
lexicographic order, 307 
product construction, 303 
subposet of, 302 
well-ordered, 304, 345 
positive, 370 
positive real numbers 
axioms for, 374 
properties, 370 
postulate, 48 
power of a function, 252 
power set, 127, 334 
formal construction, 152 
properties of, 128 
Power Set Axiom, 152 
Power Set Rule, 316 
precedence rules, 6 
preimage, 254 
properties, 256 
prime, 179 
divisibility property, 193 
infinitely many, 179 
infinitely many = 3 (mod 4), 184 
unique factorization, 193 
prime factorization 
consequences of, 197 
existence, 179 
of gcds, 198 
of rational numbers, 199 
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product, 349 
notation, 160 
of countable sets, 329 
pointwise, 240 
poset, 303 
Product Rule, 316 
product set, 133, 152 
proof template, 135 
properties, 134 
proof, 50 
by cases, 76 
by construction, 57 
by contradiction, 70 
by contrapositive, 63 
by example, 57 
by exhaustion, 82 
chain proof, 121 
circle proof, 119 
involving empty set, 115 
of 0 < 1, 370 
of necessity, 65 
of sufficiency, 65 
proof template, 56 
chain proof, 121 
circle proof, 119 
contrapositive proof, 63 
direct proof of IF-statement, 58 
for AND-statement, 64 
for backwards induction, 169 
for disproofs, 79 
for exclusive-OR statement, 72 
for existence statements, 57 
for function equality, 241 
for IFF-statements, 65 
for induction, 161 
for induction starting anywhere, 166 
for NOT-statement, 79 
for OR-statement, 72 
for set equality, 113 
for strong induction, 173 
for subsets, 106 
for uniqueness, 88 
for universal statements, 82 
product set as a subset, 135 
proof by cases, 76 
proof by contradiction, 70 
review, 99 
to prove Vn € Z, P(n), 169 
proper subset, 105 
proper superset, 105 
proposition, 1 
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propositional form, 2 

formal definition, 5 

logical equivalence, 9 
propositional logic, 1, 98 
propositional operators and quantifiers, 90 
propositional variable, 2 
provided, 19 
proving an AND-statement, 64 
proving an exclusive-OR statement, 72 
proving an IFF-statement, 65 
proving an OR statement, 72 
proving function equality, 241 


quantified variable, 83 
quantifier, 27, 98 
conversion rules, 32 
hidden, 33 
interaction rules, 90 
mixed, 84 
multiple, 83 
negation rules, 34 
reordering rules, 83, 92 
restricted, 28 
translations, 32 
quotient, 180 
uniqueness of, 182 
quotient set, 295 


rational, 47 
dyadic, 62 
in lowest terms, 195 
rational numbers, 28 
are dense in R, 379 
closure properties, 365 
definition as subset of R, 365 
form countably infinite set, 325, 329 
formal construction, 311 
ordering of, 314 
prime factorization, 199 
reading list, 383 
real numbers, 28, 349 
Archimedean property, 378 
closed subset of, 148 
closure properties, 350 
distance between, 372 
open subset of, 148 
order properties, 368 
ordering relations, 349 
uncountability of, 333 
undefined terms, 349 
recursive definition, 5, 159 


Index 


recursively defined sequence, 168 
reflexivity, 13, 276, 279 
Regularity Axiom, 156 
relation, 211 

antisymmetric, 301 

closure properties, 226 

composition, 221 

definition review, 341 

digraph of, 275 

graph of, 212 

identity, 220 

image of set, 213 

inverse of, 219 

irreflexive, 307 

on a set X, 275 

partial order, 301 

properties of, 226 

reflexive, 276 

symmetric, 277 

theorem review, 342 

total order, 302 

transitive, 278 
remainder, 180 

uniqueness of, 182 
reordering of quantifiers, 83 
Replacement Axiom, 154 
representative of equivalence class, 289 
restricted quantifier, 28 

conversion rules, 32 
restriction 

of a function, 248 

of an ordering, 302 
review 

of cardinality, 341, 344 

of functions, 340, 343 

of logic and proofs, 97 

of relations, 341, 342 

of sets and integers, 203 
Right Cancellation Law, 274, 352 
right inverse, 271 
ring, 351 
root of real number, 200 
row-reduction of a matrix, 188 
Russell’s Paradox, 149 


Schröder-Bernstein Theorem, 336 
semantic concept, 51 
sequence 

finite, 331 

infinite, 332 

recursively defined, 168 
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set 
€-transitive, 131 
axiomatic development of, 149 
bounded above, 351, 376 
bounded below, 376 
closed set, 148 
closed under operations, 338 
countable, 328 
countably infinite, 323 
difference, 151 
disjoint, 115, 316 
equality, 113 
informal definition, 103 
intersection, 151 
open set, 148 
pairwise disjoint, 115, 147 
power set of, 127 
product, 133, 152 
proof advice, 123 
set-builder notation, 144 
uncountable, 332 
with n elements, 315 
with at most 3 elements, 125 
ZFC axioms, 156 
set difference, 104, 105 
formal construction, 151 
set equality, 113 
proof template, 113 
properties, 113 
set operations, 103 
set partition, 295, 342 
and equivalence relations, 298, 345 
induced by equivalence relation, 295 
set theory 
axiomatic, 383 
informal introduction, 103 
review, 203 
set-builder notation, 144 
Set-Theoretic Induction Principle, 363 
seven, 350 
sign function, 78, 283 
sign rules, 357 
singleton set, 125 
six, 350 
small set, 125 
Specification Axiom, 151 
square, 358 
sum of, 162 
square roots in R, 379 
strict partial order, 307 
strictly negative, 370 
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strictly positive, 370 
strong induction, 173 
strong induction hypothesis, 174 
Strong Induction Proof Template, 173 
Strong Induction Theorem, 173 
subposet, 302 
subset, 104 
and power sets, 127 
characterization of, 120 
definition, 105 
of Z is countable, 328 
proof involving product set, 135 
proof template, 106 
properties of, 106 
substitution property for sets, 114 
substitution rules, 14 
sufficient, 18 
sum, 349 
additivity, 164 
of cubes, 164 
of odd integers, 162 
of squares, 162 
pointwise, 240 
telescoping, 170 
Sum Rule, 316 
summation index, 160 
summation notation, 160 
superset, 105 
supremum of a set, 377 
surjection, 260, 261 
composition of, 264 
surjective function, 261 
symmetric difference, 124 
symmetry, 13, 277, 279 
syntactic concept, 51 


T (true), 2 

tautology, 21, 48 

telescoping sum, 170 

ten, 350 

term, 51, 332 

theorem, 50 

Theorem on Absolute Value, 371 

Theorem on Addition, 352 

Theorem on Addition and Multiplication 
mod n, 310 

Theorem on Algebraic Operations in Q, 
312 

Theorem on Algebraic Properties of Order, 
369 

Theorem on Archimedean Property of R, 
378 


Index 


Theorem on Cardinal Equality and Order, 
322 

Theorem on Characteristic Functions, 243 

Theorem on Closure for Relations, 226 

Theorem on Closure Properties of Nonzero 
Real Numbers, 356 

Theorem on Congruence mod n, 284 

Theorem on Countability of Products, 325 

Theorem on Countable Sets, 329 

Theorem on Density of Q in R, 379 

Theorem on Direct Images, 255 

Theorem on Distance, 373 

Theorem on Distribution of Z in R, 379 

Theorem on Division, 358 

Theorem on Elements of Z/n, 308 

Theorem on Equivalence Classes, 291 

Theorem on Equivalence Relations and Set 
Partitions, 298 

Theorem on Existence of Prime 
Factorizations, 179 

Theorem on Existence of Real Square 
Roots, 379 

Theorem on Factorization of Divisors, 197 

Theorem on Finite Sets, 316 

Theorem on Fractions, 359 

Theorem on Function Composition, 247 

Theorem on General Unions and 
Intersections, 142 

Theorem on Greatest Common Divisors, 
185 

Theorem on Greatest Lower Bounds, 377 

Theorem on IF, 12 

Theorem on IFF, 20 

Theorem on Images, 215 

Theorem on Infinitude of Primes, 179 

Theorem on Injections, Surjections, and 
Bijections, 264 

Theorem on Inverses and Bijections, 269 

Theorem on Inverses and Composition, 268 

Theorem on Invertible Functions, 270 

Theorem on Left Inverses, 271 

Theorem on Logical Equivalence, 9 

Theorem on Multiplication, 358 

Theorem on Multiplication by Zero, 356 

Theorem on Nested Intervals, 380 

Theorem on Order Properties in R, 368 

Theorem on Pointwise Operations, 242 

Theorem on Positive Numbers, 370 

Theorem on Power Sets, 128 

Theorem on Preimages, 256 

Theorem on Prime Factorization of GCDs, 
198 


Index 


Theorem on Prime Factorization of 
Integers, 194 
Theorem on Product Sets, 134 
Theorem on Products of Countable Sets, 
329 
Theorem on Products of Integers, 197 
Theorem on Rationality of Roots, 200 
Theorem on Reflexivity, Symmetry, and 
Transitivity, 279 
Theorem on Relations, 226 
Theorem on Right Inverses, 272 
Theorem on Set Equality, 113 
Theorem on Signs, 357 
Theorem on Strong Completeness of Z, 378 
Theorem on Subsets, 106 
Theorem on Subtraction, 353 
Theorem on Sum of Odd Integers, 162 
Theorem on Sum of Squares, 162 
Theorem on Triangle Inequality, 372 
Theorem on Uncountability of Closed 
Intervals, 334 
Theorem on Uncountable Sets, 332 
Theorem on Uncountable Sets of Sequences, 
332 
Theorem on Uncountable Subsets of R, 333 
Theorem on Unions of Countable Sets, 330 
Theorem on Unique Prime Factorization of 
Rational Numbers, 199 
Theorem on Uniqueness of Prime 
Factorizations, 193 
Theorem on Well-Ordering of N, 376 
Theorem on Writing GCDs as Linear 
Combinations, 192 
Theorem on XOR, 21 
theorems requiring Axiom of Choice, 155 
three, 350 
total ordering, 302 
transitivity, 13, 122, 278, 279 
€-transitive set, 131 
of IFF, 24 
translation of quantifiers, 32 
translations of IFF, 20 
Triangle Inequality, 372 
trichotomy law, 368 
triple 
ordered, 139 
unordered, 125 
truth table, 2, 4 
two, 350 
two-part proof of IFF-statement, 65 
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uncountability of R, 333 
uncountability of intervals, 333, 334 
uncountable set, 332 
undefined concept in set theory, 149 
undefined term, 47, 349 
uniform continuity, 88 
union, 103, 105 

formal construction, 152 

of countable sets, 330 

of indexed collection of sets, 141 
Union Axiom, 152 
uniqueness, 41, 88 

of inverse functions, 269 

of prime factorizations, 193 

of quotient and remainder, 182 
uniqueness symbol, 41, 47 

how to eliminate, 43 
universal instantiation, 51 
universal quantifier, 27 
universal set, 28, 150 
universal statement about all integers, 169 
universe, 28 

does not exist, 151, 336 
unless, 19 
unordered pair, 125, 314 

formal construction, 152 
unordered triple, 125 
unrestricted quantifier, 28 
upper bound, 351, 376 
useful denial, 35 
using a known subset statement, 111 


value of function, 231 
variable, 27 

Venn diagram, 103 
vertical line test, 233 


weakly negative, 370 

weakly positive, 370 

well-defined function, 235, 250, 309, 339 
in Q, 312 

well-ordered poset, 304, 345 

well-ordering of Z>o, 165, 305, 376 


XOR, see exclusive-OR 


Zermelo—Fraenkel—Choice set theory, 149 
zero, 349 

divisor, 49, 356 

multiplication property, 356 
ZFC set theory, 149 
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